All Issue

2022 Vol.41, Issue 3 Preview Page

Research Article

31 May 2022. pp. 326-334
Abstract
References
1
T. Virtanen, R. Singh, and B. Raj, Techniques for Noise Robustness in Automatic Speech Recognition (John Wiley & Sons, New York, 2012), pp. 109-154. 10.1002/9781118392683
2
M. Ẅolfel and J. McDonoug, Distant Speech Recognition (John Wiley & Sons, New York, 2009), pp. 387-491.
3
J. Droppo and A. Acero. Environmental Robustness (Springer, Heidelberg, 2008), pp. 653-680. 10.1007/978-3-540-49127-9_33
4
M. Kim and H.-M. Park, "Efficient online target speech extraction using DOA-constrained independent component analysis of stereo data for robust speech recognition," Signal Processing, 117, 126-137 (2015). 10.1016/j.sigpro.2015.04.022
5
L. Albera, "Independent component analysis and applications," Handbook of Blind Source Separation: Independent Component Analysis and Applications, edited by P. Comon and C. Jutten (Academic press, Kidlington, 2010).
6
S. Haykin, Unsupervised Adaptive Filtering, volume 1: Blind Source Separation (John Wiley & Sons, New York, 2000), pp. 238-258.
7
A. Hyv̈arinen, J. Karhunen, and E. Oja, Independent Component Analysis and Blind Source Separation (John Wiley & Son, New York, 2001), pp. 4-42. 10.1002/0471221317
8
Y. Takahashi, T. Takatani, K. Osako, H. Saruwatari, and K. Shikano, "Blind spatial subtraction array for speech enhancement in noisy environment," IEEE Transactions on Audio, Speech, and Language Processing, 17, 650-664 (2009). 10.1109/TASL.2008.2011517
9
F. Nesta and M. Matassoni, "Blind source extraction for robust speech recognition in multisource noisy environments," Computer Speech and Language, 27, 703-725 (2013). 10.1016/j.csl.2012.08.001
10
M. El Rhabi, H. Fenniri, A. Keziou, and E. Moreau, "A robust algorithm for convolutive blind source separation in presence of noise," Signal Processing, 93, 818-827 (2013). 10.1016/j.sigpro.2012.09.026
11
T. Kim, H. T. Attias, S.-Y. Lee, and T.-W. Lee, "Blind source separation exploiting higher-order frequency dependencies," IEEE Transactions on Audio, Speech, and Language Processing, 15, 70-79 (2007). 10.1109/TASL.2006.872618
12
T. Kim, "Real-time independent vector analysis for convolutive blind source separation," IEEE Transactions on Circuits and Systems I: Regular Papers, 57, 1431-1438 (2010). 10.1109/TCSI.2010.2048777
13
M. Oh and H.-M. Park, "Blind source separation based on independent vector analysis using feed-forward network," Neurocomputing, 74, 3713-3715 (2011). 10.1016/j.neucom.2011.06.008
14
I. Lee, G.-J. Jang, and T.-W. Lee, "Independent vector analysis using densities represented by chain-like overlapped cliques in graphical models for separation of convolutedly mixed signals," Electronics Letters, 45, 710-711 (2009). 10.1049/el.2009.0945
15
C.-H. Choi, W. Chang, and S.-Y. Lee, "Blind source separation of speech and music signals using harmonic frequency dependent independent vector analysis," Electronics Letters, 48, 124-125 (2012). 10.1049/el.2011.3215
16
N. Ono, "Stable and fast update rules for independent vector analysis based on auxiliary function technique," Proc. IEEE WASPAA, 189-192 (2011). 10.1109/ASPAA.2011.6082320
17
N. Ono, "Auxiliary-function-based independent vector analysis with power of vector-norm type weighting functions," Proc. APSIPA, 1-4 (2012).
18
D. D. Lee and H. S. Seung, "Learning the parts of objects by non-negative matrix factorization," Nature, 401, 788 (1999). 10.1038/4456510548103
19
D. D. Lee and H. S. Seung, "Algorithms for non- negative matrix factorization," Advances in Neural Information Processing Systems, 13, 556-562 (2001).
20
D. Kitamura, N. Ono, H. Sawada, H. Kameoka, and H. Saruwatari, "Efficient multichannel nonnegative matrix factorization exploiting rank-1 spatial model," Proc. IEEE ICASSP, 276-280 (2015). 10.1109/ICASSP.2015.7177975
21
D. Kitamura, N. Ono, H. Sawada, H. Kameoka, and H. Saruwatari, "Determined blind source separation unifying independent vector analysis and nonnegative matrix factorization," IEEE/ACM TASLP, 24, 1622- 1637 (2016). 10.1109/TASLP.2016.2577880
22
U.-H. Shin and H.-M. Park, "Auxiliary-function-based independent vector analysis using generalized inter- clique dependence source models with clique variance estimation," IEEE Access, 8, 68103-68113 (2020). 10.1109/ACCESS.2020.2985842
23
A. R. Ĺopez, N. Ono, U. Remes, K. Palom̈aki, and M. Kurimo, "Designing multichannel source separation based on single-channel source separation," Proc. IEEE ICASSP, 469-473 (2015).
24
Z. Koldovsḱy, P. Tichavsḱy, and V. Kautsk, "Orthogonally constrained independent component extraction: Blind MPDR beamforming," Proc. EUSIPCO, 1155- 1159 (2017). 10.23919/EUSIPCO.2017.8081389
25
T. Kounovsḱy, Z. Koldovsky, and J. Cmejla, "Recursive and partially supervised algorithms for speech enhancement on the basis of independent vector extraction," Proc. IWAENC, 401-405 (2018). 10.1109/IWAENC.2018.8521399
26
J.-F. Cardoso, "Multidimensional independent component analysis," Proc. IEEE ICASSP, 4, 1941-1944 (1998).
27
D. FitzGerald, M. Cranitch, and E. Coyle, "Non- negative tensor factorisation for sound source separation," Proc. Irish Signals and Systems Conf. 8-12 (2005). 10.1049/cp:20050279
28
J. Heymann, L. Drude, and R. Haeb-Umbach, "Neural network based spectral mask estimation for acoustic beamforming," Proc. IEEE ICASSP, 196-200 (2016). 10.1109/ICASSP.2016.7471664
29
A. Schwarz and W. Kellermann, "Coherent-to-diffuse power ratio estimation for dereverberation," IEEE/ ACM Transactions on Audio, Speech, and Language Processing, 23, 1006-1018 (2015). 10.1109/TASLP.2015.2418571
30
R. Lee, M.-S. Kang, B.-H. Kim, K.-H. Park, S. Q. Lee, and H.-M. Park, "Sound source localization based on gcc-phat with diffuseness mask in noisy and reverberant environments," IEEE Access, 8, 7373-7382 (2020). 10.1109/ACCESS.2019.2963768
31
J. Caroselli, I. Shafran, A. Narayanan, and R. Rose, "Adaptive multichannel dereverberation for automatic speech recognition," Proc. Interspeech, 3877-3881 (2017). 10.21437/Interspeech.2017-1791
32
B. J. Cho, J.-M. Lee, and H.-M. Park, "A beamforming algorithm based on maximum likelihood of a complex gaussian distribution with time-varying variances for robust speech recognition," IEEE Signal Processing Letters, 26, 1398-1402 (2019). 10.1109/LSP.2019.2932848
33
E. Vincent, S. Watanabe, A. A. Nugraha, J. Barker, and R. Marxer, "An analysis of environment, microphone and data simulation mismatches in robust speech recognition," Computer Speech & Language, 46, 535-557 (2017). 10.1016/j.csl.2016.11.005
34
J. Barker, R. Marxer, E. Vincent, and S. Watanabe, "The third "CHiME" speech separation and recognition challenge: Dataset, task and baselines," Proc. IEEE Workshop on ASRU, 504-511 (2015). 10.1109/ASRU.2015.740483726035872
35
T. Higuchi, N. Ito, T. Yoshioka, and T. Nakatani, "Robust MVDR beamforming using time-frequency masks for online/offline ASR in noise," Proc. IEEE ICASSP, 5210-5214 (2016). 10.1109/ICASSP.2016.7472671
36
O. L. Frost, "An algorithm for linearly constrained adaptive array processing," Proceedings of the IEEE, 60, 926-935 (1972). 10.1109/PROC.1972.8817
Information
  • Publisher :The Acoustical Society of Korea
  • Publisher(Ko) :한국음향학회
  • Journal Title :The Journal of the Acoustical Society of Korea
  • Journal Title(Ko) :한국음향학회지
  • Volume : 41
  • No :3
  • Pages :326-334
  • Received Date : 2022-03-21
  • Revised Date : 2022-05-03
  • Accepted Date : 2022-05-11