최소 단어 이상 선택하여야 합니다.
최대 10 단어까지만 선택 가능합니다.
다음과 같은 기능을 한번의 로그인으로 사용 할 수 있습니다.
NTIS 바로가기방송공학회논문지 = Journal of broadcast engineering, v.24 no.5, 2019년, pp.735 - 742
한석현 (광운대학교 전자공학과) , 김재원 (광운대학교 전자공학과) , 안순호 (광운대학교 전자공학과) , 신성현 (광운대학교 전자공학과) , 박호종 (광운대학교 전자공학과)
In this paper, we propose a method of extracting speech features for phoneme recognition based on spikegram. The Fourier-transform-based features are widely used in phoneme recognition, but they are not extracted in a biologically plausible way and cannot have high temporal resolution due to the fra...
* AI 자동 식별 결과로 적합하지 않은 문장이 있을 수 있으니, 이용에 유의하시기 바랍니다.
D. Yu and L. Deng, Automatic Speech Recognition: A Deep Learning Approach, Springer Publishing Company, Incorporated, 2014.
O. Abdel-Hamid, A. Mohamed, H. Jiang, L. Deng, G. Penn and D. Yu, "Convolutional Neural Networks for Speech Recognition," IEEE/ACM Trans. on Audio, Speech, and Language Processing, Vol. 22, No. 10, pp. 1533-1545, Oct. 2014, doi:10.1109/TASLP.2014. 2339736.
E. Smith and M. Lewicki, "Efficient Auditory Coding," Nature, Vol. 439, No. 7079, pp. 978-982, Feb. 2006, doi:10.1038/nature04485.
W.-J. Jang, H.-W. Yun, S.-H. Shin and H. Park, "Music genre classification using spikegram and deep neural network," J. of Broadcast Engineering, Vol. 22, No. 6, pp. 693-701, Nov. 2017, doi:10.5909/JBE. 2017.22.6.693.
S.-H. Shin, H.-W. Yun, W.-J. Jang and H. Park, "Extraction of acoustic features based on auditory spike code and its application to music genre classification," IET Signal Processing, Vol. 13, No. 2, pp. 230-234, Apr. 2019, doi:10.1049/iet-spr.2018.5158.
G. Mather, Foundations of Perception, Psychology Press, 2006.
M. Slaney, "An Efficient Implementation of the Patterson - Holdsworth Auditory Filter Bank," Apple Computer Technical Report #35, 1993.
J. Tropp and A. Gilbert, "Signal Recovery From Random Measurements Via Orthogonal Matching Pursuit," IEEE Trans. on Information Theory, Vol. 53, No. 12, Dec. 2007, doi:10.1109/TIT. 2007.909108.
X. Huang, A. Acero, and H. Hon. Spoken Language Processing: A guide to theory, algorithm, and system development. Prentice Hall, 2001.
I. Goodfellow, Y. Bengio, A. Courville, Deep Learning, The MIT Press, Cambridge and London, 2016.
K. F. Lee and H. W. Hon, "Speaker-independent phone recognition using hidden markov models," IEEE Trans. on Audio, Speech, Lang. Process., Vol. 37, No. 11, pp. 1641-1648, Nov. 1989, doi:10.1109/29. 46546.
N. Faraji, S. M. Ahadi and H. Sheikhzadeh, "Sequential method for speech segmentation based on Random Matrix Theory," IET Signal Processing, Vol. 7, No. 7, pp. 625-633, Sept. 2013, doi:10.1049/ietspr.2011.0471.
P. Ladefoged and I. Maddieson. The Sounds of the World's Languages. Oxford, OX, UK: Blackwell Publishers, 1996.
※ AI-Helper는 부적절한 답변을 할 수 있습니다.