All Issue

2022 Vol.41, Issue 2 Preview Page

Research Article

31 March 2022. pp. 122-129
Abstract
References
1
T. Ko, V. Peddinti, D. Povey, and S. Khudanpur, "Audio augmentation for speech recognition," Proc. Interspeech, 3586­3589 (2015). 10.21437/Interspeech.2015-711
2
D. S. Park, W. Chan, Y. Zhang, C.-C. Chiu, B. Zoph, E. D. Cubuk, and Q. V. Le, "SpecAugment: A simple data augmentation method for automatic speech recognition," arXiv:1904.08779 (2019). 10.21437/Interspeech.2019-2680
3
X. Song, Z. Wu, Y. Huang, D. Su, and H. Meng, "SpecSwap: A simple data augmentation method for end­to­end speech recognition," Proc. Interspeech, 581­ 585 (2020). 10.21437/Interspeech.2020-2275
4
D. B. Paul and J. M. Baker, "The design for the wall street journal­based CSR corpus," Proc. Speech and Natural Language Workshop, 357-362 (1992). 10.3115/1075527.1075614
5
V. Panayotov, G. Chen, D. Povey, and S. Khudanpur, "LibriSpeech: An ASR corpus based on public domain audio books," Proc. ICASSP. 5206­5210 (2015). 10.1109/ICASSP.2015.7178964
6
W. Chan, N. Jaitly, Q. V. Le, and O. Vinyals, "Listen, attend and spell," arXiv:1508.01211 (2018).
7
A. Vaswani, N. Shazeer, N. Parmar, J. Uszkoreit, L. Jones, A. N. Gomez, L. Kaiser, and I. Polosukhin, "Attention is all you need," Proc. NIPS. 5998­6008 (2017).
8
A. Graves, S. Fernandez, F. Gomez, and J. Schmidhuber, "Connectionist temporal classification: Labelling unsegmented sequence data with recurrent neural networks," Proc. ICML. 369­376 (2006). 10.1145/1143844.1143891
9
S. Kim, T. Hori, and S. Watanabe, "Joint CTC­attention based end­to­end speech recognition using multi­task learning," Proc. ICASSP. 4835­4839 (2017). 10.1109/ICASSP.2017.7953075
10
Sox, Audio Manipulation Tool, http://sox.sourceforge.net/ , (Last viewed March 25, 2015).
11
S. Watanabe, T. Hori, S. Karita, T. Hayashi, J. Nishitoba, Y. Unno, N. E. Y. Soplin, J. Heymann, M. Wiesner, N. Chen, A. Renduchintala, and T. Ochiai, "ESPnet: End-to-end speech processing toolkit," arXiv: 1804.00015 (2018). 10.21437/Interspeech.2018-145629730221
Information
  • Publisher :The Acoustical Society of Korea
  • Publisher(Ko) :한국음향학회
  • Journal Title :The Journal of the Acoustical Society of Korea
  • Journal Title(Ko) :한국음향학회지
  • Volume : 41
  • No :2
  • Pages :122-129
  • Received Date : 2021-09-28
  • Revised Date : 2021-12-20
  • Accepted Date : 2021-12-30