All Issue

2020 Vol.39, Issue 5 Preview Page

Research Article

September 2020. pp. 441-446
Abstract
References
1
D. Snyder, D. Garcia-Romero, G. Sell, D. Povey, and S. Khudanpur, "X-vectors: Robust dnn embeddings for speaker recognition," Proc. ICASSP. 5329-5333 (2018).
10.1109/ICASSP.2018.8461375
2
J. Jung, H. Heo, Y. Yang, H. Shim, and H. Yu, "A complete end-to-end speaker verification system using deep neural networks: from raw signals to verification result," Proc. ICASSP. 5349-5353 (2018).
10.1109/ICASSP.2018.8462575
3
J.Jung, H. Heo, H. Shim, and H. Yu, "Short utterance compensation in speaker verification via cosine-based teacher-student learning of speaker embeddings," Proc. IEEE ASRU. 335-341 (2019).
10.1109/ASRU46091.2019.9004029
4
H. Muckenhirn, M. Doss, and S. Marcell, "Towards directly modeling raw speech signal for speaker verification using CNNs," Proc. ICASSP. 4884-4888 (2018).
10.1109/ICASSP.2018.8462165
5
J. Jung, H. Heo, J. Kim, H. Shim, and H. Yu, "RawNet: Advanced end-to-end deep neural network using raw waveforms for text-independent speaker verification," Proc. Interspeech, 1268-1272 (2019).
10.21437/Interspeech.2019-1982
6
J. Jung, S. Kim, H. Shim, J. Kim, and H. Yu, "Improved RawNet with filter-wise rescaling for text- independent speaker verification using raw waveforms," arxiv preprint arXiv:2004.00526 (2020).
7
H. Kaiming, Z. Xiangyu, R. Shaoqing, and S. Jian, "Identity mappings in deep residual networks," Proc. ECCV. 30-645 (2016).
8
J. Hu, L. Shen, S. Albanie, G. Sun, and E. Wu, "Squeeze-and-excitation networks," Proc. IEEE CVPR. 7132-7141 (2018).
10.1109/CVPR.2018.00745
9
J. Zhang, N. Inoue, and K. Shinoda, "I-vector transformation us-ing conditional generative adversarial networks for short utterance speaker verification," Proc. Interspeech, 3613-3617 (2018).
10.21437/Interspeech.2018-1680PMC6127343
10
J. Chung, A. Nagrani, and A. Zisserman, "VoxCeleb2: deep speaker recognition," Proc. Interspeech, 1086- 1090 (2018).
10.21437/Interspeech.2018-1929PMC6639222
11
A. Nagrani, J. Chung, and A. Zisserman, "VoxCeleb: a large-scale speaker identification dataset," Proc. Interspeech, 2616-2620 (2017).
10.21437/Interspeech.2017-950
12
M. Ravanelli and Y. Bengio. "Speaker recognition from raw waveform with sincnet," Proc. IEEE SLT. 1021-1028 (2018).
10.1109/SLT.2018.8639585
Information
  • Publisher :The Acoustical Society Of Korea
  • Publisher(Ko) :한국음향학회
  • Journal Title :The Journal of the Acoustical Society of Korea
  • Journal Title(Ko) :한국음향학회지
  • Volume : 39
  • No :5
  • Pages :441-446
  • Received Date :2020. 07. 18
  • Accepted Date : 2020. 09. 08