최소 단어 이상 선택하여야 합니다.
최대 10 단어까지만 선택 가능합니다.
다음과 같은 기능을 한번의 로그인으로 사용 할 수 있습니다.
NTIS 바로가기IEEE access : practical research, open solutions, v.8, 2020년, pp.175448 - 175466
Jung, Youngmoon (Korea Advanced Institute of Science and Technology, School of Electrical Engineering, Daejeon, South Korea) , Choi, Yeunju (Korea Advanced Institute of Science and Technology, School of Electrical Engineering, Daejeon, South Korea) , Lim, Hyungjun (Korea Advanced Institute of Science and Technology, School of Electrical Engineering, Daejeon, South Korea) , Kim, Hoirin (Korea Advanced Institute of Science and Technology, School of Electrical Engineering, Daejeon, South Korea)
Speaker verification (SV) has recently attracted considerable research interest due to the growing popularity of virtual assistants. At the same time, there is an increasing requirement for an SV system: it should be robust to short speech segments, especially in noisy and reverberant environments. ...
Proc IEEE Workshop Autom Speech Recog and Understanding The Kaldi speech recognition toolkit povey 2011
Jongseo Sohn, Nam Soo Kim, Wonyong Sung. A statistical model-based voice activity detection. IEEE signal processing letters, vol.6, no.1, 1-3.
Varga, A., Steeneken, H.J.M.. Assessment for automatic speech recognition: II. NOISEX-92: A database and an experiment to study the effect of additive noise on speech recognition systems. Speech communication, vol.12, no.3, 247-251.
IEEE Transactions on Audio Speech and Language Processing A tandem algorithm for pitch estimation and voiced speech segregation hu 2010 10.1109/TASL.2010.2041110 18 2067
J Mach Learn Res Visualizing data using t-SNE van der maaten 2008 9 2579
Proc Adv Neural Inf Process Syst Autodiff Workshop Automatic differentiation in pytorch paszke 2017
Wang, Feng, Cheng, Jian, Liu, Weiyang, Liu, Haijun. Additive Margin Softmax for Face Verification. IEEE signal processing letters, vol.25, no.7, 926-930.
Proc 33rd Int Conf Mach Learn Large-margin softmax loss for convolutional neural networks liu 2016 507
Buda, Mateusz, Maki, Atsuto, Mazurowski, Maciej A.. A systematic study of the class imbalance problem in convolutional neural networks. Neural networks : the official journal of the International Neural Network Society, vol.106, 249-259.
Ghosh, Prasanta Kumar, Tsiartas, Andreas, Narayanan, Shrikanth. Robust Voice Activity Detection Using Long-Term Signal Variability. IEEE transactions on audio, speech, and language processing, vol.19, no.3, 600-613.
arXiv 2003 12266 Dual attention in time and frequency domain for voice activity detection lee 2020
arXiv 2005 03867 Multi-task network for noise-robust keyword spotting and speaker verification using CTC-based soft VAD and global query attention jung 2020
arXiv 1510 08484 [cs] MUSAN: A music, speech, and noise corpus snyder 2015
Aurora working group: DSR front end LVCSR evaluation AU/384/02 pearce 2002
Dehak, Najim, Kenny, Patrick J, Dehak, Réda, Dumouchel, Pierre, Ouellet, Pierre. Front-End Factor Analysis for Speaker Verification. IEEE transactions on audio, speech, and language processing, vol.19, no.4, 788-798.
Hansen, John H. L., Hasan, Taufiq. Speaker Recognition by Machines and Humans: A tutorial review. IEEE signal processing magazine, vol.32, no.6, 74-99.
Wang, Shuai, Huang, Zili, Qian, Yanmin, Yu, Kai. Discriminative Neural Embedding Learning for Short-Duration Text-Independent Speaker Verification. IEEE/ACM transactions on audio, speech, and language processing, vol.27, no.11, 1686-1696.
arXiv 2004 03194 Improving multi-scale aggregation using feature pyramid module for robust speaker verification of variable-duration utterances jung 2020
arXiv 2004 02863 Meta-learning for short utterance speaker recognition with imbalance length pairs min kye 2020
Al-Ali, Ahmed Kamil Hasan, Dean, David, Senadji, Bouchra, Chandran, Vinod, Naik, Ganesh R.. Enhanced Forensic Speaker Verification Using a Combination of DWT and MFCC Feature Warping in the Presence of Noise and Reverberation Conditions. IEEE access : practical research, open solutions, vol.5, 15400-15413.
Proc Int Conf Med Image Comput -Assist Intervent U-net: Convolutional networks for biomedical image segmentation ronneberger 2015 234
Xiao-Lei Zhang, DeLiang Wang. Boosting Contextual Information for Deep Neural Network Based Voice Activity Detection. IEEE/ACM transactions on audio, speech, and language processing, vol.24, no.2, 252-264.
Proc INTERSPEECH Comparison of forced-alignment speech recognition and humans for generating reference VAD kraljevski 2015 2937
Proc Conf Neural Inf Process Syst Residual networks behave like ensembles of relatively shallow networks veit 2016 550
Zhang, Chunlei, Koishida, Kazuhito, Hansen, John H. L.. Text-Independent Speaker Verification Based on Triplet Convolutional Neural Network Embeddings. IEEE/ACM transactions on audio, speech, and language processing, vol.26, no.9, 1633-1644.
Proc INTERSPEECH A time delay neural network architecture for efficient modeling of long temporal contexts peddinti 2015 3214
Proc Int Conf Learn Represent Very deep convolutional networks for large-scale image recognition simonyan 2015
Proc Odyssey Bayesian speaker verification with heavy-tailed priors kenny 2010 14
Ioffe, S.. Probabilistic Linear Discriminant Analysis. Lecture notes in computer science, vol.3954, 531-542.
Zhang, Xingyu, Zou, Xia, Sun, Meng, Zheng, Thomas Fang, Jia, Chong, Wang, Yimin. Noise Robust Speaker Recognition Based on Adaptive Frame Weighting in GMM for i-Vector Extraction. IEEE access : practical research, open solutions, vol.7, 27874-27882.
He, K., Zhang, X., Ren, S., Sun, J.. Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition. Lecture notes in computer science, vol.8691, 346-361.
Campbell, W.M., Sturim, D.E., Reynolds, D.A.. Support vector machines using GMM supervectors for speaker verification. IEEE signal processing letters, vol.13, no.5, 308-311.
해당 논문의 주제분야에서 활용도가 높은 상위 5개 콘텐츠를 보여줍니다.
더보기 버튼을 클릭하시면 더 많은 관련자료를 살펴볼 수 있습니다.
*원문 PDF 파일 및 링크정보가 존재하지 않을 경우 KISTI DDS 시스템에서 제공하는 원문복사서비스를 사용할 수 있습니다.
오픈액세스 학술지에 출판된 논문
※ AI-Helper는 부적절한 답변을 할 수 있습니다.