[논문]2D 이미지에서 3D 조형물 인식을 위한 딥러닝 네트워크 구조설계

김예진

2D 이미지에서 3D 조형물 인식을 위한 딥러닝 네트워크 구조설계 원문보기

김예진 (상명대학교 일반대학원, 저작권보호학과, 국내박사)

초록 ▼
AI-Helper

최근 다양한 분야에서 딥러닝을 비롯한 인공지능 기술의 활용되고 있다. 특히 컴퓨터 비전 분야에서는 객체의 인식과 검출을 목적으로 한 딥러닝 기술들이 뛰어난 성능을 보이며 연구 또한 활발히 진행되고 있다. 딥러닝 기술을 활용함으로써 기존의 방법론으로 해결이 어려운 문제들을 쉽게 해결하는 성과를 거두고 있다.
많은 객체 인식 ...

최근 다양한 분야에서 딥러닝을 비롯한 인공지능 기술의 활용되고 있다. 특히 컴퓨터 비전 분야에서는 객체의 인식과 검출을 목적으로 한 딥러닝 기술들이 뛰어난 성능을 보이며 연구 또한 활발히 진행되고 있다. 딥러닝 기술을 활용함으로써 기존의 방법론으로 해결이 어려운 문제들을 쉽게 해결하는 성과를 거두고 있다.
많은 객체 인식 알고리즘은 객체의 가장 특징적인 단면적 면을 또는 한 부분을 메인 특징라고 여기고 특징을 추출 분석 비교 하여 인식하는 것이 통상적이다. 그러나 대부분 물체는 3D형태로 되어 있다. 물체의 거리, 위치, 앵글 각도에 따라 2D 이미지에 노출되는 형태가 다르다. 또한 객체를 대표할 수 있는 특징이 한 면에만 꼭 있는 것이 아니다. 현재 많은 특징 추출 및 객체 인식 컴퓨터 비전 알고리즘을 연구가 되고 있지만, 대부분 제안된 객체 인식 알고리즘들은 그 객체의 특징이라 할 수 있는 정면부분이 노출될 때의 모델들을 데이터 셋으로 사용하고 그에 대한 인식결과 또한 정면이나 특징적인 부분을 가지고 계산한다. 3D 객체를 인식하는 알고리즘은 객체의 깊이 정보 등의 부가적인 정보가 추가되어 설계된 알고리즘이 대부분이다.
본 논문에서는 객체의 전반 특징을 다루고 3D 객체를 인식하기 위하여 조각상과 3D 캐릭터 두 가지 유형 모델을 사용하여 2D이미지 속 3D 객체를 검출할 수 있는 딥러닝 네트워크를 설계한다. 객체의 추출 시간의 단축을 위하여, 객체의 중심점(key point)과 boundary box를 예측하여 하나의 객체 당 하나의 앵커박스를 사용하여 객체를 추출한다.
설계된 네트워크 중 필터를 설계함에 있어 기존에 있던 큰 필터 대신 작은 필터를 중첩 사용하여 연산 량은 줄이고 가중치가 작아지고 시각화시 레이어가 많아져 특징이 더 고차원적이 됨으로 인식률을 높여준다. 트레이닝할 때 드롭아웃 (drop-out)시킴으로써 오버피팅(over-fitting) 을 감소시켰다 기존의 시그모이드 (sigmoid)함수를 유닛수정함수(rectified unit function)로 대체함으로써 검사소멸효과 (gradient-vanishing effect)를 줄였다.
3D 객체 중 조형물은 87.9%, 3D 캐릭터 모델은 90.2% 인식률에 도달 하여 기존에 연구되었던 결과와 비교 하였을 때 인식률이 5-15% 증가하였으며 실시간 검출이 가능하다.

Abstract ▼ AI-Helper

Recently, the use of artificial intelligence, including deep learning, has risen across various fields. Most notably, in the study of computer vision techniques, new algorithms based on deep learning have shown excellent object recognition and detection, and problems that could not be solved by former methodologies are solved using deep learning technology.
Many object recognition algorithms identify the most characteristic cross-sectional area or part of an object as its main feature; it then extracts and compares features to distinguish them. However, most objects are three-dimensional, and depending on the distance, position, and angle of the object, the shape exposed on the two-dimensional image capture varies. Also, there is not only one feature that can represent an object. Currently, many feature extraction and object recognition computer vision algorithms have been studied, but most of the proposed object recognition algorithms use the front view of a model, which they take as the most salient feature of the object. This is then used as the dataset, and the recognition results are calculated with the frontal or characteristic parts. Algorithms recognizing 3D objects are mostly designed by adding supplementary information such as the depth information of an object.
In this paper, a deep learning network was designed that can detect 3D objects in 2D images by using two types of models of statues and 3D characters to expand the algorithms awareness of the general features of objects, allowing it to recognize 3D objects. In order to shorten the extraction time of the object’s features, an object is extracted using one anchor box per object, and by predicting the key point and boundary box of the object.
In designing the filter of the proposed neural network, the small filter is superimposed instead of the preexisting large filter, reducing the computational complexity and the weight, as well as increase the number of layers in the visualization. Also, drop-out during training reduces some overfitting. By replacing the existing sigmoid function with a rectified unit function, the vanishing gradient effect is reduced.
The recognition rate was 87.9% for 3D objects and 90.2% for 3D character models, and that rate was found to be 5-15% higher when compared with previous studies. It is also noted that

주제어

학위논문 정보

저자	김예진
학위수여기관	상명대학교 일반대학원,
학위구분	국내박사
학과	저작권보호학과,
지도교수	김종원
발행연도	2020
총페이지	95 p.:
키워드	딥러닝
언어	eng
원문 URL	http://www.riss.kr/link?id=T15487054&outLink=K
정보원	한국교육학술정보원

내보내기 구분	파일저장 인쇄 메일전송
구성항목	기본정보 상세정보 관리번호, 논문명(한글), 저자명(한글), 학위수여기관, 학위연도, 학위구분, 학과, 총페이지, 키워드, 초록(한글), 초록(영문) 관리번호, 논문명(한글), 논문명(영문), 저자명(한글), 저자명(영문), 학위수여기관, 학위연도, 학위구분, 학과, 총페이지, 키워드, 초록(한글), 초록(영문)
저장형식	Text(ASCII format) Excel format
메일정보	받는사람 (필수) @ 보내는사람 (선택) @ 제목 내용 KISTI 검색결과 이메일 서비스
안내	총 건의 자료가 검색되었습니다. 다운받으실 자료의 인덱스를 입력하세요. (1-10,000) 검색결과의 순서대로 최대 10,000건 까지 다운로드가 가능합니다. 데이타가 많을 경우 속도가 느려질 수 있습니다.(최대 2~3분 소요) 다운로드 파일은 UTF-8 형태로 저장됩니다. 파일의 내용이 제대로 보이지 않을실 때는 웹브라우저 상단의 보기 -> 인코딩 -> 자동선택 여부를 확인하십시오. ~ Text(ASCII format) Excel format

연합인증

2D 이미지에서 3D 조형물 인식을 위한 딥러닝 네트워크 구조설계 원문보기

초록 ▼
AI-Helper

Abstract ▼ AI-Helper

주제어

학위논문 정보

이 논문을 인용한 문헌

관련 콘텐츠

원문 보기

이 논문과 함께 이용한 콘텐츠

AI-Helper ※ AI-Helper는 오픈소스 모델을 사용합니다.

선택된 텍스트

연합인증

2D 이미지에서 3D 조형물 인식을 위한 딥러닝 네트워크 구조설계 원문보기

초록 ▼ 용어보기논문에서 용어와 풀이말을 자동 추출한 결과로, 시범 서비스 중입니다. AI-Helper

Abstract ▼ AI-Helper

주제어

학위논문 정보

이 논문을 인용한 문헌

관련 콘텐츠

원문 보기

이 논문과 함께 이용한 콘텐츠

AI-Helper ※ AI-Helper는 오픈소스 모델을 사용합니다.

선택된 텍스트

초록 ▼
AI-Helper