For forensic application, speaker verification consists of evaluating whether the voice of a suspect matches the evidence audio recording. In this project, we propose a solution based on machine learning for speaker verification of audios with fake intonation. The input of the system corresponds to indirect characteristics of the audio recordings, and the classifier is a neural network, in which the hyperparameters are adjusted using cross validation. The performance results are: OA (Overall Accuracy) of 88.2%, P (Precision) of 84.5%, R (Recall) of 90%, F1 of 87.2% and AUC (Area under the Curve) of 93.8%.
원문 PDF 다운로드
원문 PDF 파일 및 링크정보가 존재하지 않을 경우 KISTI DDS 시스템에서 제공하는 원문복사서비스를 사용할 수 있습니다. 원문복사서비스 안내 바로 가기
DOI 인용 스타일