Research Article
D. J. Creasey, Remote Sensing for Environmental Sciences (Springer, Berlin, Heidelberg, 1976), pp. 277-303.
10.1007/978-3-642-66236-2_8G. R. Arrabito, B. E. Cooke, and S. M. McFadden, "Recommendations for enhancing the role of the auditory modality for processing sonar data," Appl. Acoust. 66, 986-1005 (2005).
10.1016/j.apacoust.2004.11.010D. Kobus and L. Lewandowski, "Critical factors in sonar operation: A survey of experienced operators," NHRC Tech. Rep., 1991.
A. Krizhevsky, I. Sutskever, and G. E. Hinton, "Imagenet classification with deep convolutional neural networks," Adv. Neural. Inf. Process. Syst. 26, 1097-1105 (2012).
K. He, X. Zang, S. Ren, and J. Sun, "Deep residual learning for image recognition," Proc. IEEE CVPR, 770-778 (2016).
10.1109/CVPR.2016.9026180094J. Devlin, M. W. Chang, K. Lee, and K. Toutanova, "Bert: Pre-training of deep bidirectional transformers for language understanding", Proc. NAACL, 4171-4186 (2019).
T. Brown, B. Mann, N. Ryder, M. Subbiah, J. D. Kaplan, P. Dhariwal, A. Neelakantan, P. Shyam, G. Sastry, A. Askell, S. Agarwal, A. Herbert-Voss, G. Krueger, T. Henighan, R. Child, A. Ramesh, D. Ziegler, J. Wu, C. Winter, C. Hesse, M. Chen, E. Sigler, M. Litwin, S. Gray, B. Chess, J. Clark, C. Berner, S. McCandlish, A. Radford, I. Sutskever, and Dario Amodei, "Language models are few-shot learners," Adv. Neural. Inf. Process. Syst. 33, 1877-1901 (2020).
A. Vaswani, N. Shazeer, N. Parmar, J. Uszkoreit, L. Jones, A. N. Gomez, L. Kaiser, and I. Polosukhin, "Attention is all you need," Adv. Neural. Inf. Process. Syst. 31, 6000-6010 (2017).
A. van den Oord, S. Dieleman, H. Zen, K. Simonyan, O. Vinyals, A. Graves, N. Kalchbrenner, A. Senior, and K. Kavukcuoglu, "WaveNet: A generative model for raw audio," arXiv preprint arXiv:1609.03499 (2016).
A. Gulati, J. Qin, C.C. Chiu, N. Parmar, Y. Zhang, J. Yu, W. Han, S. Wang, Z. Zhang, Y. Wu, and R. Pang, "Conformer: Convolution-augmented transformer for speech recognition," arXiv preprint arXiv:2005.08100 (2020).
10.21437/Interspeech.2020-3015G. H. Ko, K. Lee, and C. H. Lee, "Passive sonar signal classification using graph neural network based on image patch" (in Korean), J. Acoust. Soc. Kr. 43, 234-242 (2024).
H. Yang, J. Li, S. Shen, and G. Xu, "A deep convolutional neural network inspired by auditory perception for underwater acoustic target recognition," Sensors, 19, 1104 (2019).
10.3390/s1905110430836716PMC6427555R. O. Nielsen, "Cramer-Rao lower bounds for sonar broad-band modulation parameters," IEEE J. Oceanic Eng. 24, 285-290 (1999).
10.1109/48.775290K. Choi, D. Joo, and J. Kim, "Kapre: On-gpu audio preprocessing layers for a quick implementation of deep neural network models with keras," arXiv preprint arXiv:1706.05781 (2017).
D. Santos-Domínguez, S. Torres-Guijarro, A. Cardenal-López, and A. Pena-Gimenez, "ShipsEar: An underwater vessel noise database," Appl. Acoust. 113, 64-69 (2016).
10.1016/j.apacoust.2016.06.008J. Xu, Y. Xie, and W. Wang, "Underwater acoustic target recognition based on smoothness-inducing regularization and spectrogram-based data augmentation," Ocean Eng. 281, 114926 (2023).
10.1016/j.oceaneng.2023.114926K. He, X. Zhang, S. Ren, and J. Sun, "Identity mappings in deep residual networks," Proc. 14th European Conference, 630-645 (2016).
10.1007/978-3-319-46493-0_38M. Sandler, A. Howard, M. Zhu, A. Zhmoginov, and L. C. Chen, "Mobilenetv2: Inverted residuals and linear bottlenecks," Proc. IEEE CVPR, 4510-4520 (2018).
10.1109/CVPR.2018.00474- Publisher :The Acoustical Society of Korea
- Publisher(Ko) :한국음향학회
- Journal Title :The Journal of the Acoustical Society of Korea
- Journal Title(Ko) :한국음향학회지
- Volume : 44
- No :2
- Pages :85-93
- Received Date : 2024-11-20
- Revised Date : 2025-01-03
- Accepted Date : 2025-02-17
- DOI :https://doi.org/10.7776/ASK.2025.44.2.085