최소 단어 이상 선택하여야 합니다.
최대 10 단어까지만 선택 가능합니다.
다음과 같은 기능을 한번의 로그인으로 사용 할 수 있습니다.
NTIS 바로가기한국전자통신학회 논문지 = The Journal of the Korea Institute of Electronic Communication Sciences, v.10 no.9, 2015년, pp.993 - 1000
Mel-Frequency Cepstral Coefficients(: MFCC) is one of the noble feature vectors for speech signal processing. An evident drawback in MFCC is that the phase information is lost by taking the magnitude of the Fourier transform. In this paper, we consider a method of utilizing the phase information by ...
* AI 자동 식별 결과로 적합하지 않은 문장이 있을 수 있으니, 이용에 유의하시기 바랍니다.
G. Kaplan, "Words into action: I," IEEE Spectrum, vol. 17, 1980, pp. 22-26.
Y. Chang, S. Hung, N. Wang, and B. Lin, "CSR: A Cloud-assisted speech recognition service for personal mobile device," Int. Conf. on Parallel Processing, Taipei, Taiwan, Sep. 2011, pp. 305-314.
M. Kang, "A Study on the Design of Multimedia Service Platform on Wireless Intelligent Technology," J. of the Korea Institute of Electronic Communication Sciences, vol. 4, no. 1, 2009, pp. 24-30.
J. Yoo, H. Park, H. Shin, and Y. Shin, "A Study of the Communication Infrastructure Construction for u-City in Korea," J. of the Korea Institute of Electronic Communication Sciences, vol. 1, no. 2, 2006, pp. 127-135.
B. Kim, "Service Quality Criteria for Voice Services over a WiBro Network," J. of the Korea Institute of Electronic Communication Sciences, vol. 6, no. 6, 2011, pp. 823-829.
J. W. Picone, "Signal modeling techniques in speech recognition," Proc. IEEE, vol. 81, no. 9, 1993, pp. 1215-1247.
B. Bozkurt and L. Couvreur, "On the use of phase information for speech recognition," In Proc. of Eusipco, Antalya, Turkey, 2005, pp. 1-4.
K. K. Paliwal, "Usefulness of phase in speech processing", Proc. IPSJ Spoken Language Processing Workshop, Gifu, Japan, Feb. 2003, pp. 1-6.
J. C. Wang, J. F. Wang, and Y. Weng, "Chip design of MFCC extraction for speech recognition," The VLSI Journal, vol. 32, 2002, pp. 111-131.
J. M. Bioucas-Dias and G. Valadao, "Phase Unwrapping via Graph Cuts," IEEE Trans. on Image Processing, vol. 16 no. 3, 2007, pp. 698-709.
T. Drugman, B. Bozkurt, and T. Dutoit, "Complex Cepstrum-Based Decomposition of Speech for Glottal Source Estimation," Interspeech, Brighton, Sep. 2009, pp. 116-119.
L. Fausett, Fundamentals of Neural Networks, New Jersey: Prentice-Hall, 1994.
J. R. Deller, J. G. Proakis, and J. H. L. Hansen, Discrete-Time Processing of Speech Signals, New York: Macmillan, 1994.
W. Xu, Zhengzhou, Y. Guo, B. Wang and X. Wang, "A Noise Robust Front-End Using Wiener Filter, Probability Model and CMS for ASR," Int. Conf. on Natural Language Processing and Knowledge Engineering, Zhengzhou, China, 2005, pp. 102-105.
M. Dehghan, K. Faez, M. Ahmadi, and M. Shridhar, "Unconstrained Farsi Handwritten Word Recognition Using Fuzzy Vector Quantization and Hidden Markov models," Pattern Recognition Letters, vol. 22, 2001, pp. 209-214.
*원문 PDF 파일 및 링크정보가 존재하지 않을 경우 KISTI DDS 시스템에서 제공하는 원문복사서비스를 사용할 수 있습니다.
Free Access. 출판사/학술단체 등이 허락한 무료 공개 사이트를 통해 자유로운 이용이 가능한 논문
※ AI-Helper는 부적절한 답변을 할 수 있습니다.