[보고서]휴먼인터페이스기술개발에 관한 연구

김상룡

휴먼인터페이스기술개발에 관한 연구 원문보기

보고서 정보
주관연구기관	삼성전자(주) 종합기술원 Samsung Advanced Institute of Technology
연구책임자	김상룡
보고서유형	2단계보고서
발행국가	대한민국
언어	한국어
발행년월	2002-12
주관부처	과학기술부
사업 관리 기관	한국과학재단 Korea Science and Engineering Foundtion
등록번호	TRKO200800067238
사업명	특정연구개발사업<중점국가연구개발사업
DB 구축일자	2013-04-18
키워드	음성인식.음성합성.음성부호화.얼굴인식.애니메이션.HCI 프로세서 칩.Speech Understanding.Speech Synthesis.Speech Coding.Face Recognition.Animation.HCI Processor Chip.

초록 ▼

1. 구어 이해 기술
(1) 연속어 음성인식-6만 단어, 단어 인식률 : 93.4%(화자독립), 94.9%(화자적용)
(2) 대화체 이해 기술-3,000의도(intention), 대화성공률 : 91%
(3) 화자식별 기술-비고정 문장 화자식별 : 100명 화자, EER(Equal Error Rate) : 2.19%
(4) HCI Processor 용 음성인식기 -1만 단어, 연속어 음성인식, 인식률 : 95%
2. 음성합성 및 신호처리
(1) 로롯 대화체 음성합성기 - 500MB, MOS평가 : 4.1
(2) HCI Processor 용 음성합성기 - 10MB, MOS평가 : 3.5
(3) 협대역/광대역 음성 부호화기 - 4kbit/s : G.723.1 5.3kbit/s 동등, 16kbit/s : G.722.1 48kbit/s 동등
(4) 자동 레이블러 - 좌우 20ms 허용범위 내에서 정확률 95.17%
3. 휴먼 인식 및 합성
(1) 얼굴 검출 및 인식 기술
. 얼굴 검출-초당 4frame 검출, 검출률 : 98%
. 얼굴 인식-대용량 : 500명[95%], 표준화 : ANMRR[0.270], 240bits(Descriptor Size)
. 3D Human Face Animation-24가지 근육, 44 action units, 10가지 입모양
4. HCI 프로세서 칩
(1) 멀티모달 지원
. 음성인식(1만 단어), 음성합성(10MB), 음성부호화기(4k/16k) 얼굴인식(10명)
. 220만 Gate급 프로세서 H/W 구현
(2) HCI 칩 내의 H/W Engine(FFT,HMM,Image Convolution)
(3) ARM920T MCU core 및 TeakLite DSP Core를 이용한 Dual Processor 구조
(4) One Chip(Max, 792Mips급), 저전력(300mw), 공정(0.25$\mum)

Abstract ▼

** 1 st Year
O Specifications
o Baseline construction and problem definitions of each core technologies
O Details
o Baseline construction of large vocabulary continuous speech recognition system
o Development of speaker verification algorithm
o Preliminary study on spoken language understanding
o Implementation of narrow band speech codec (4kbit/s)
o Preliminary study on wide band speech codec
o Development of Face Detection an Verification system in office environment
o Generation of 3-D standard face mode
o Multi-processor Design Methodology Setup
** 2nd Year
O Specifications
o Baseline construction of preliminary system and core algorithm for processing speech/face image & emotion
O Details
o Development of 10K size LVCSR
o Development of speaker verification
o Designing prototype of spoken language understanding
o International standardization of narrow band speech codec (ITU- T)
o Building wide band speech codec (16~32kbit/s)
o Emotion analysis and feature extraction
o 3D human face modeling ~ facial expression
o Fixed-point Modeling
**3rd Year
O Specifications
o Spoken language understanding
O Details
o 40K size LVCSR (speed 3sec, accuracy 95%)
o Designing spoken language understanding algorithm (vocabularies: 5,000 words)
o fixed sentence speaker identification (100 persons, ERR 1 %)
o Implementation of speech synthesis module for generation of dialog-type prosody
o Implementation of narrow band variable bit rate speech codec (1 ~8kbit/s)
o Implementation of high quality wide band speech codec (16kbit/s)
o Development of large size face recognition (500 persons, 95%)
o Development of animation for face expression
o Micro Architecture development/architecture and verification
**4th Year
O Specifications
o HCI Processor
O Details
o 60K size LVCSR (Speed: 3sec, accuracy 95%)
o Spoken language understanding (speed: 2sec, accuracy 90%)
o Text-free speaker verification (100 persons, ERR 1 %)
o Speech synthesis with self- learning module
o Optimization of narrow band speech codec
o Optimization of wide band speech codec
o Development of face emotion detection
o Development of human animation system
o Implementation of visual user interface
o Building data acquisition and basic library ASIC
o Setup micro architecture & design/development/verification development/design/verification

목차 Contents

제1장 연구개발과제의 개요...18
제1절 연구개발의 목적...18
제2절 연구개발의 필요성...18
제3절 연구개발의 범위...20
제2장 국내외 기술개발 현황...22
제1절 기술개발 현황...22
제2절 본 과제의 위치...25
제3장 연구개발 수행 내용 및 결과...26
제1절 구어 이해 기술...26
제2절 음성합성기 및 신호처리...66
제3절 휴먼 인식 및 합성...93
제4절 HCI 프로세서 칩...116
제4장 목표달성도 및 관련분야에의 기여도...136
제1절 연구개발 목표의 달성도...136
제2절 기술발전에의 기여도...139
제5장 연구개발결과의 활용계획...140
제1절 추가 연구의 필요성...140
제2절 타 연구에의 응용...141
제3절 기업화 추진 방향...142
제6장 연구개발과정에서 수집한 해외과학기술정보...143
제7장 참고문헌...155

과제명(ProjectTitle) :	-
연구책임자(Manager) :	-
과제기간(DetailSeriesProject) :	-
총연구비 (DetailSeriesProject) :	-
키워드(keyword) :	-
과제수행기간(LeadAgency) :	-
연구목표(Goal) :	-
연구내용(Abstract) :	-
기대효과(Effect) :	-

내보내기 구분	파일저장 인쇄 메일전송
구성항목	기본정보 상세정보 관리번호, 제목(한글), 저자명(한글), 발행일자, 전자원문, 초록(한글), 초록(영문) 관리번호, 제목(한글), 제목(영문), 저자명(한글), 저자명(영문), 주관연구기관(한글), 주관연구기관(영문), 발행일자, 총페이지수, 주관부처명, 과제시작일, 보고서번호, 과제종료일, 주제분류, 키워드(한글), 전자원문, 키워드(영문), 입수제어번호, 초록(한글), 초록(영문), 목차
저장형식	Text(ASCII format) Excel format
메일정보	받는사람 (필수) @ 보내는사람 (선택) @ 제목 내용 KISTI 검색결과 이메일 서비스
안내	총 건의 자료가 검색되었습니다. 다운받으실 자료의 인덱스를 입력하세요. (1-10,000) 검색결과의 순서대로 최대 10,000건 까지 다운로드가 가능합니다. 데이타가 많을 경우 속도가 느려질 수 있습니다.(최대 2~3분 소요) 다운로드 파일은 UTF-8 형태로 저장됩니다. 파일의 내용이 제대로 보이지 않을실 때는 웹브라우저 상단의 보기 -> 인코딩 -> 자동선택 여부를 확인하십시오. ~ Text(ASCII format) Excel format

연합인증

휴먼인터페이스기술개발에 관한 연구 원문보기

초록 ▼

Abstract ▼

목차 Contents

연구자의 다른 보고서 :

참고문헌 (25)

연구과제 타임라인

관련 콘텐츠

원문 보기

이 보고서와 함께 이용한 콘텐츠

연관된 기능

AI-Helper ※ AI-Helper는 오픈소스 모델을 사용합니다.

선택된 텍스트

연합인증

휴먼인터페이스기술개발에 관한 연구 원문보기

초록 ▼

Abstract ▼

목차 Contents

연구자의 다른 보고서 :

김상룡 (1)

참고문헌 (25)

연구과제 타임라인

전체(0) 논문(0) 특허(0) 보고서(0)

전체(0) 논문(0) 특허(0) 보고서(0)

관련 콘텐츠

원문 보기

이 보고서와 함께 이용한 콘텐츠

연관된 기능

AI-Helper ※ AI-Helper는 오픈소스 모델을 사용합니다.

선택된 텍스트