[논문]Attention 기반의 3차원 UNet을 통한 비디오 이상치 탐지

조준흠

Attention 기반의 3차원 UNet을 통한 비디오 이상치 탐지
Video Anomaly Detection via Attention based 3D UNet 원문보기

조준흠 (연세대학교 일반대학원 컴퓨터과학과 국내석사)

초록 ▼
AI-Helper

지능형 비디오 감시 시스템의 이상탐지는 중요하고 지속적인 과제이다. 이상 탐지 모델은 연속적 비디오 프레임 내에서 개별 객체들의 정상적인 패턴에서 벗어나는 이벤트를 탐지해야 한다. 그러나 2D conv 연산으로 구성된 생성적 적대 네트워크(GAN)을 사용한 미래 프레임 예측 방법은 입력의 채널을 연결하여 입력으로 사용하고, 2D conv 연산은 각 채널을 더하게 되므로 연속적인 비디오 프레임에서 순차적 정보를 학습하기 어렵다. 이 문제를 해결하기 위해 연속된 비디오 내에서 각 객체를 감지하고, attention 알고리즘과 3D conv 연산으로 각 객체의 시공간 정보를 포착하고 비디오의 순차적인 정보를 학습하여 미래 프레임을 예측하는 U-Net 모델을 제안한다. 우리는 YOLOv5를 사용하여 비디오 프레임 내의 객체를 탐지하여 불필요한 배경에 대한 훈련을 최소화하고, 탐지된 객체의 bounding box 영역을 스케일링하여 각 객체의 정보를 최대화하여 모델을 훈련시킨다. 우리는 개체에 대한 스케일링의 중요성을 보여주기 위해 ablation study를 수행한다. 또한, 우리는 포착된 객체의 bounding box 크기가 다르기 때문에 모델 훈련을 위해 동일한 크기로 변환하면서 작은 이미지를 흐리게 하는 문제를 해결하기 위해 가중 피크 신호 대 잡음비(PSNR)를 제안한다. 우리의 모델은 세 가지 벤치마크(USCD Ped2, Avenue, Shanghai Tech)에서 최첨단 방법과 비교하여 우수한 결과를 달성한다.

Abstract ▼ AI-Helper

Anomaly detection in intelligent video surveillance systems is an important and ongoing task. An anomaly detection model should identify events that deviate from the normal pattern of individual objects within sequential video frames. But existing prediction methods of future frames, such as generative adversarial networks (GANs) using 2D convolution, use channels as inputs by concatenating them, and 2D convolution adds up them, making it difficult to learn sequential information in successive video frames. To address this problem, we propose a model to predict future frames by learning sequential information from previous video sequences by detecting each object within the video sequence and capturing spatio-temporal information of each object with attention and 3D convolution in U-Net. We use YOLOv5 to detect objects within a video frame to minimize training on unnecessary backgrounds, and train the model by maximizing the information of each object by scaling the bounding box area of the detected object. We conduct an ablation study to show the importance of scaling on objects. Additionally, we propose a weighted peak signal-to-noise ratios (PSNR) to solve the problem of blurring small images while converting to the same size for model training because the captured objects have different bounding box sizes. To the best of our knowledge in concerning predictive models, our model achieves superior results compared with state-of-the-art methods on three benchmarks (USCD Ped2, Avenue, and Shanghai Tech).

주제어

학위논문 정보

저자	조준흠
학위수여기관	연세대학교 일반대학원
학위구분	국내석사
학과	컴퓨터과학과
지도교수	박상현
발행연도	2023
총페이지	v, 42 p.
키워드	Video anomaly detection attention object detection future frame prediction
언어	eng
원문 URL	http://www.riss.kr/link?id=T16627403&outLink=K
정보원	한국교육학술정보원

내보내기 구분	파일저장 인쇄 메일전송
구성항목	기본정보 상세정보 관리번호, 논문명(한글), 저자명(한글), 학위수여기관, 학위연도, 학위구분, 학과, 총페이지, 키워드, 초록(한글), 초록(영문) 관리번호, 논문명(한글), 논문명(영문), 저자명(한글), 저자명(영문), 학위수여기관, 학위연도, 학위구분, 학과, 총페이지, 키워드, 초록(한글), 초록(영문)
저장형식	Text(ASCII format) Excel format
메일정보	받는사람 (필수) @ 보내는사람 (선택) @ 제목 내용 KISTI 검색결과 이메일 서비스
안내	총 건의 자료가 검색되었습니다. 다운받으실 자료의 인덱스를 입력하세요. (1-10,000) 검색결과의 순서대로 최대 10,000건 까지 다운로드가 가능합니다. 데이타가 많을 경우 속도가 느려질 수 있습니다.(최대 2~3분 소요) 다운로드 파일은 UTF-8 형태로 저장됩니다. 파일의 내용이 제대로 보이지 않을실 때는 웹브라우저 상단의 보기 -> 인코딩 -> 자동선택 여부를 확인하십시오. ~ Text(ASCII format) Excel format

연합인증

Attention 기반의 3차원 UNet을 통한 비디오 이상치 탐지
Video Anomaly Detection via Attention based 3D UNet 원문보기

초록 ▼
AI-Helper

Abstract ▼ AI-Helper

주제어

학위논문 정보

이 논문을 인용한 문헌

관련 콘텐츠

원문 보기

이 논문과 함께 이용한 콘텐츠

AI-Helper ※ AI-Helper는 오픈소스 모델을 사용합니다.

선택된 텍스트

연합인증

Attention 기반의 3차원 UNet을 통한 비디오 이상치 탐지 Video Anomaly Detection via Attention based 3D UNet 원문보기

초록 ▼ 용어보기논문에서 용어와 풀이말을 자동 추출한 결과로, 시범 서비스 중입니다. AI-Helper

Abstract ▼ AI-Helper

주제어

학위논문 정보

이 논문을 인용한 문헌

관련 콘텐츠

원문 보기

이 논문과 함께 이용한 콘텐츠

AI-Helper ※ AI-Helper는 오픈소스 모델을 사용합니다.

선택된 텍스트

Attention 기반의 3차원 UNet을 통한 비디오 이상치 탐지
Video Anomaly Detection via Attention based 3D UNet 원문보기

초록 ▼
AI-Helper