[논문]디지털 정지영상 및 동영상 포맷 변환 : 완전 압축영역 접근방식

김도년

디지털 정지영상 및 동영상 포맷 변환 : 완전 압축영역 접근방식
Digital image/video format conversion : the complete compressed domain approaches 원문보기

김도년 (Graduate School, Yonsei University Dept. of Electrical and Electronic Engineering 국내박사)

초록 ▼
AI-Helper

압축영역 디지털 정지영상 및 동영상 처리분야는 지난 몇년동안 빠르게 발전하였다. 일반적으로 영상은 변환 부호화(Transform Coding)로 압축된체로 저장되기 때문에 공간영역에서 복원된 영상을 처리하는 것 보다 압축영역에서 직접 영상을 처리하는 것이 더욱 효ㅗ과적이다. 영상 편집 및 특수효과를 내기 위하여 실시간으로 디지털 영상을 처리할 필요가 있다. 영상 처리에는 다운샘플링, 명암 조절, 영상 조각의 이동, 필터링, 마스킹, 회전, 움직임 보상 등이 있다. 본 연구에서는 영상의 크기를 바꾸고자 할 때, 압축 영역에서 고속 DCT 알고리즘을 이용하여 연산함으로써, 상당한 곌산량을 줄일 수 있었다. 다운샘플링 속도를 크게 개선하기 위하여, 두가지 방법을 제안하였다. 첫번째 방법은 바이리니어 인터폴레이션을 이용하여 다운샘플링을 하는 것이고, 두번째 방법은 DCT의 저주파만을 잘라내어 근사값을 구하여 다운샘플링을 한다. 이 두가지 고속 알고리즘의 화질은 서로 비슷하며, 두번째 방법이 첫번째 방법보다 계산량이 13% 적다. 또한 DCT의 저주파만을 잘라내어 업샘플링 하는 방법에 대한 고속 알고리즘을 제안하였다. 이렇게 함으로써 기존 방법보다 32% 계산량을 줄일 수 있었다. 아울러, 압축영역 이미지 및 비디오를 조작할 때에, 새로 유도한 위노그라드 DCT 알고리즘들을 이용하면 계산량을 획기적으로 줄일 수 있음을 알 수 있었다. 정지영상 압축 표준인 JPEC를 이용하여, 압축영역 업샘플링 및 다운 샘플링을 수행하여, 제안된 방법과 이전 방법들의 화질 성능을 비교하였다. DV에서 MPEG-2로 변환부호화(Transcoding)는, 위에서 제안한 포맷 변환의 한 예가 될 수 있다. 따라서, 제안한 다운샘플링 방법 및 업샘플링 방법을 변환부호화에 적용할 수 있다. 본 논문에서는 효율적인 DV에서 MPEG-2로 변환부호화(Transcoding)를 소개하였다. 이러한 변환부호화를 할때 필요한 매크로 블록 재배열과 DV 신호를 복호화할 때 필요한 데이터 섞음을 푸는 것에 대하여 기술하였다. 중간 처리 단계를 줄여서 복잡도를 줄이기 위하여 DCT 영역에서 DV에서 MPEG-2로 변환부호화할 필요가 있다. DV 방식의 2-4-8 DCT 모드를 일반적인 압축 부호화기에 널리 쓰이는 8-8 DCT 모드로 별환 시킬 때, 제안된 고속 알고리즘들을 적용하여 계산상의 복잡도를 줄일 수 있다. MPEG-2의 TM-5에 기술된 율 제어 알고리즘을 완전히 DCT 영역에서 구현하기 위하여, DCT 영역에서 서브 블록의 분산을 계산하였다. DV에서 MPEG-2 인터 부호화 방식으로 변환부호화(Transcoding)할 때 DCT계수의 일부분만 IDCT하여 움직임을 추정하였다. 이때, 탐색 영역을 일부분 중첩하는 경우가 중첩하지 않은 경우보다 높은 PSNR을 보였다. 본 방식을 이용하면 전역 탐색보다 움직임 추정 속도를 크게 줄일 수 있었으며, 피크 신호대잡음비는 약 . dB 감소하였다. 본 논문에서 유도된, 입력 시퀀스 크기가 4 혹은 6에 대한 위노그라드 DCT를 다른 영상 처리분야에 이용하면, 계산량을 크게 불일 수 있을 것으로 기대된다.

Abstract ▼ AI-Helper

Compressed-domain processing of digital images and video stream is a problem area of rapidly increasing interest in the last few years. Since images are usually stored in the transformed domain as compressed data, it is more efficient to operate images directly in the compressed domain than in the spatial domain. Certain applications require real-time manipulation of digital video in order to implement image composition and special effects, e.g., down sampling, modifying contrast and brightness, translation, filtering, masking, rotation, inverse motion compensation, etc. In this thesis, significant improvement in complexity has been showed for the compressed domain size change in image, by using fast DCT's. Downsampling schemes which use the bilinear interpolation and DCT lowpass truncated approximation respectively have been presented recently in literature. Fast algorithms for both are proposed in this thesis. Both fast algorithms have very similarquality in images while the latter has about 13% less computational complex-ity. The fast algorithm for an upsampling scheme using DCT lowpass truncated approximation yields significant improvement of about 32% of basic arithmetic operations over an earlier method. The newly derived Winograd DCT algorithms can be exploited in the compressed domain image/video manipulation. Transcoding Digital Video (DV) to MPEG-2 can be an example of format conversions. The proposed fast downsampling and upsampling algorithms can be applied to this transcoding. An e±cient transcoding DV to MPEG-2 Intra coding is presented. Rearrangement of the macro-block pixels for the conversion and de-shu²ing in the DV coding are presented. Transcoding DV to MPEG-2 Intra coding is performed in the DCT domain to reduce conversion steps. To convert the 4:1:1 format to the 4:2:2 format, and the 2-4-8 DCT mode to the 8-8 DCT mode, the proposed fast algorithms are applied. This conversion yield significant improvement in computational complexity compared to the simple approach. The variances of sub blocks for mquant of the MPEG-2 test model 5 (TM-5) rate control algorithm are computed in the DCT domain. Fast motion estimation taking advantage of part of DCT coeficients is studied for transcoding into MPEG-2 inter coding. Motion estimation with over-lapped search range shows better PSNR performance than motion estimation without overlapping. This approach improves computational complexity significantly whereas the PSNR value of this approach is about 1 dB less than that of the full search method. The proposed approaches here for image resizing can be expected to be useful for developing fast algorithms for other linear operations of digital video to which the four-point or six-point Winograd DCT's can be applied.

주제어

학위논문 정보

저자	김도년
학위수여기관	Graduate School, Yonsei University
학위구분	국내박사
학과	Dept. of Electrical and Electronic Engineering
지도교수	Yoonsik Choe
발행연도	2004
총페이지	xiii, 87장
키워드	압축영역처리 이산코사인변환 움직임 추정 여파기 다운샘플링 업샘플링 콘볼루션 compressed domain processing discrete cosine transform DV JPEG MPEG motion estimation filtering downsampling upsampling convolution
언어	eng
원문 URL	http://www.riss.kr/link?id=T9239401&outLink=K
정보원	한국교육학술정보원

표제어: PCR

동의어: Packet Collision Rate

용어 설명 출처 목록 (6)

용어 설명: PCR은 세균 특이성이 있는 primer를 이용하여 적은 수의 세균이 있을지라도 쉽게 검출할 수 있는 유용한 방법이며, 이를 이용하여 구강 내 치면세균막이나 타액에서 직접 세균을 검출할 수 있게 되었다[8].

내보내기 구분	파일저장 인쇄 메일전송
구성항목	기본정보 상세정보 관리번호, 논문명(한글), 저자명(한글), 학위수여기관, 학위연도, 학위구분, 학과, 총페이지, 키워드, 초록(한글), 초록(영문) 관리번호, 논문명(한글), 논문명(영문), 저자명(한글), 저자명(영문), 학위수여기관, 학위연도, 학위구분, 학과, 총페이지, 키워드, 초록(한글), 초록(영문)
저장형식	Text(ASCII format) Excel format
메일정보	받는사람 (필수) @ 보내는사람 (선택) @ 제목 내용 KISTI 검색결과 이메일 서비스
안내	총 건의 자료가 검색되었습니다. 다운받으실 자료의 인덱스를 입력하세요. (1-10,000) 검색결과의 순서대로 최대 10,000건 까지 다운로드가 가능합니다. 데이타가 많을 경우 속도가 느려질 수 있습니다.(최대 2~3분 소요) 다운로드 파일은 UTF-8 형태로 저장됩니다. 파일의 내용이 제대로 보이지 않을실 때는 웹브라우저 상단의 보기 -> 인코딩 -> 자동선택 여부를 확인하십시오. ~ Text(ASCII format) Excel format

연합인증

디지털 정지영상 및 동영상 포맷 변환 : 완전 압축영역 접근방식
Digital image/video format conversion : the complete compressed domain approaches 원문보기

초록 ▼
AI-Helper

Abstract ▼ AI-Helper

주제어

학위논문 정보

이 논문을 인용한 문헌

관련 콘텐츠

원문 보기

이 논문과 함께 이용한 콘텐츠

AI-Helper ※ AI-Helper는 오픈소스 모델을 사용합니다.

선택된 텍스트

연합인증

디지털 정지영상 및 동영상 포맷 변환 : 완전 압축영역 접근방식 Digital image/video format conversion : the complete compressed domain approaches 원문보기

초록 ▼ 용어보기논문에서 용어와 풀이말을 자동 추출한 결과로, 시범 서비스 중입니다. AI-Helper

Abstract ▼ AI-Helper

주제어

학위논문 정보

이 논문을 인용한 문헌

관련 콘텐츠

원문 보기

이 논문과 함께 이용한 콘텐츠

AI-Helper ※ AI-Helper는 오픈소스 모델을 사용합니다.

선택된 텍스트

디지털 정지영상 및 동영상 포맷 변환 : 완전 압축영역 접근방식
Digital image/video format conversion : the complete compressed domain approaches 원문보기

초록 ▼
AI-Helper