[논문]NPU Simulator 설계 및 Convolution 연산 Acceleration 평가

권구윤

[학위논문] NPU Simulator 설계 및 Convolution 연산 Acceleration 평가
Design of NPU Simulator and Evaluation of Convolution Acceleration 원문보기

권구윤 (고려대학교 대학원 반도체시스템공학과 국내석사)

초록 ▼
AI-Helper

인공신경망 기술이 자율주행 자동차, 인공지능 기반 CCTV와 같은 에지 디바이스에 적용되면서 임베디드 환경을 위한 NPU가 주목받고 있다. 그러나 대부분의 NPU에 관한 연구는 기업을 중심으로 이루어져 NPU Microarchitecture에 접근하고 활용하기 어렵다. 이로 인해, 접근이 용이한 NPU Microarchitecture 와 NPU에 관한 새로운 전략을 실험할 환경의 조성이 필요하다. Cycle-accurate Simulator는 하드웨어의 동작을 사이클 단위로 시뮬레이션 가능한 프로그램으로, Microarchitecture의 변경과 디버깅이 쉽다는 특성이 있다. 본 논문에서는 임베디드 환경을 위한NPU Microarchitecture를 설계하고 이를 Cycle-Accurate Simulator로 구현하여 NPU 실험을 위한 환경을 조성하였다. Convolution Layer의 추론 연산을 시행하는 Convolution Instruction Stream을 설계하고 VGGNet-16과 MobileNet-v1의 Convolution Layer 추론을 통해 정상 동작을 검증하였다. 조성한 환경에서 CNN Acceleration 기법의 적용 및 성능 평가가 가능함을 보이기 위해 IFM Data Reuse 전략을 적용하고 이에 따른 성능 차이를 측정하였다. 이를 위해 IFM Data Reuse 전략을 구현하기 위한 NPU 명령어를 추가하고 Convolution Instruction Stream을 수정하였다. VGGNet-16과 MobileNet-v1 Convolution Layer 추론 처리 결과 IFM Data Reuse 전략을 적용함에 따라 Convolution Layer 추론 속도가 최대 1.326배 증가하였다. IFM을 불러오기 위한 메모리 접근 횟수는 최대 2.8배 감소하였다. 이를 통해 임베디드 시스템을 위한 NPU Microarchitecture를 확보하고 CNN Acceleration 전략을 시험할 수 있는 환경을 구축하였다.

Abstract ▼ AI-Helper

NPU for embedded environments is attaching attention as artificial neural network technology is applied to edge devices such as self-driving cars and AI-based CCTV. However, it is difficult to access NPU microarchitecture because most of the research about NPU is done by big companies. For this reason, it is necessary to design a new NPU microarchitecture and create an environment for experimenting with new ideas for NPU. Cycle-accurate simulator is often used at the hardware development stage because it is easy to modify and debug hardware microarchitecture. In this paper, I designed an NPU microarchitecture for an embedded environment and implemented it with a cycle-accurate simulator to build an environment for research about NPU. I made a NPU instruction stream that performs the inference operation of the convolution layer and verified it by processing the convolution layers of VGGNet-16 and MobileNet-v1. To show using this environment to apply new strategies and evaluate performance is available, I applied the IFM data reuse strategy and measured performance change with it. For this, I added a new instruction for the IFM data reuse strategy in NPU and modified the NPU instruction stream. The experiment shows up to 1.326x speedup processing a convolution layer by applying IFM Data Reuse Strategy. Also, memory access to load IFM data in the NPU register reduces by 2.8x by applying IFM data reuse strategy. Through this, I built the NPU microarchitecture for embedded system and environment to evaluate new CNN acceleration ideas.

주제어

학위논문 정보

저자	권구윤
학위수여기관	고려대학교 대학원
학위구분	국내석사
학과	반도체시스템공학과
지도교수	서태원
발행연도	2023
총페이지	77 p
키워드	NPU Cycle-accurate Simulator CNN IFM Data Reuse
언어	kor
원문 URL	http://www.riss.kr/link?id=T16653404&outLink=K
정보원	한국교육학술정보원

내보내기 구분	파일저장 인쇄 메일전송
구성항목	기본정보 상세정보 관리번호, 논문명(한글), 저자명(한글), 학위수여기관, 학위연도, 학위구분, 학과, 총페이지, 키워드, 초록(한글), 초록(영문) 관리번호, 논문명(한글), 논문명(영문), 저자명(한글), 저자명(영문), 학위수여기관, 학위연도, 학위구분, 학과, 총페이지, 키워드, 초록(한글), 초록(영문)
저장형식	Text(ASCII format) Excel format
메일정보	받는사람 (필수) @ 보내는사람 (선택) @ 제목 내용 KISTI 검색결과 이메일 서비스
안내	총 건의 자료가 검색되었습니다. 다운받으실 자료의 인덱스를 입력하세요. (1-10,000) 검색결과의 순서대로 최대 10,000건 까지 다운로드가 가능합니다. 데이타가 많을 경우 속도가 느려질 수 있습니다.(최대 2~3분 소요) 다운로드 파일은 UTF-8 형태로 저장됩니다. 파일의 내용이 제대로 보이지 않을실 때는 웹브라우저 상단의 보기 -> 인코딩 -> 자동선택 여부를 확인하십시오. ~ Text(ASCII format) Excel format

연합인증

[학위논문] NPU Simulator 설계 및 Convolution 연산 Acceleration 평가
Design of NPU Simulator and Evaluation of Convolution Acceleration 원문보기

초록 ▼
AI-Helper

Abstract ▼ AI-Helper

주제어

학위논문 정보

이 논문을 인용한 문헌

관련 콘텐츠

원문 보기

이 논문과 함께 이용한 콘텐츠

AI-Helper ※ AI-Helper는 오픈소스 모델을 사용합니다.

선택된 텍스트

연합인증

[학위논문] NPU Simulator 설계 및 Convolution 연산 Acceleration 평가 Design of NPU Simulator and Evaluation of Convolution Acceleration 원문보기

초록 ▼ 용어보기논문에서 용어와 풀이말을 자동 추출한 결과로, 시범 서비스 중입니다. AI-Helper

Abstract ▼ AI-Helper

주제어

학위논문 정보

이 논문을 인용한 문헌

관련 콘텐츠

원문 보기

이 논문과 함께 이용한 콘텐츠

AI-Helper ※ AI-Helper는 오픈소스 모델을 사용합니다.

선택된 텍스트

[학위논문] NPU Simulator 설계 및 Convolution 연산 Acceleration 평가
Design of NPU Simulator and Evaluation of Convolution Acceleration 원문보기

초록 ▼
AI-Helper