[논문]시계열 분석을 이용한 서울시 미세먼지 농도 예측

오세랑

시계열 분석을 이용한 서울시 미세먼지 농도 예측
Prediction of PM10 Concentration in Seoul using Time Series Decomposition 원문보기

오세랑 (전남대학교대학원 전기및반도체공학과 국내석사)

초록 ▼
AI-Helper

현재 한국을 포함한 아시아권에서는 미세먼지로 인한 질병이 심각하다. 특히 미세먼지로 인한 호흡기 관련 질병 발생률이 극심하여 미세먼지 발생억제와 미세먼지 농도 예측에 관한 연구들이 진행되어왔다. 예측 대상인 미세먼지 농도는 기상적 요소에 영향을 받기 때문에, 특정 기상요소들을 바탕으로 시계열 데이터를 구성하여 예측을 진행해왔다. 하지만 기존의 연구들은 시계열 데이터를 구성하여 시계열 특성에 적합한 알고리즘들을 사용하지만, 이는 단순히 데이터 취득 후, 알고리즘에 대입하는 과정의 반복으로 시계열 데이터의 특성을 제대로 사용하지 못했다. 이러한 기존의 미세먼지 농도 예측과정의 첫 번째 문제는 입력과 출력을 포함하는 데이터 간의 상관관계에 대한 정량적 지표의 언급이 없다는 점이다. 그리고 두 번째는 시계열의 특성을 가지는 데이터를 구성했지만, 그 특성에 적합한 시계열 데이터 분석이 이루어지지 않았다는 점이다. 세 번째는 알고리즘의 검증지표를 통한 정확도와 오차의 계산은 진행되지만, 잔차 진단을 통해 실질적인 모델의 발전을 위한 개선방안을 제시하거나 종료를 선언하는 과정이 없다. 이는 잔차 진단을 통해 모델의 개선 여지의 유무를 파악하고, 개선의 여지가 있다면 개선방안을 제안하고 개선방안이 없다면 모델의 종료를 선언하는 과정이다.
본 논문에서는 심층 순환신경망(deep recurrent neural network, DRNN)을 구현하여 서울시의 미세먼지 농도를 예측하는 모델을 구축한다. 서울시의 미세먼지 농도 예측을 위해 서울시와 인접한 다른 도시들이 영향을 미칠 것으로 가정하고, 그에 합당한 정량적 근거를 제시하여 지역 선정을 진행한다. 미세먼지 농도 예측을 위한 입력 데이터 선정은 미세먼지 발생원과 발생 후의 증감현상, 두 가지 부분으로 나누어 진행하였다. 수집한 데이터를 바탕으로 상관관계 분석을 통해 최종 입력 데이터를 선정했으며, 이렇게 구성된 시계열 데이터를 기반으로 그에 적합한 데이터 분석을 수행한다. 심층 순환신경망은 선정된 입력 데이터를 기반으로 서울시 미세먼지 농도를 예측하며, MAE(mean absolute error), MSE(mean squared error), RMSE(root mean squared error)를 검증지표로 사용하여 알고리즘의 성능을 평가한다. 마지막으로 잔차 진단을 통해 미세먼지 예측 모델의 개선 여부를 파악하고, 개선의 여지가 있다면 수정 및 재학습을 진행하며 개선의 여지가 없다면 모델의 종료를 선언한다.

Abstract ▼ AI-Helper

Currently, diseases caused by PM10 are serious in Asia, including Korea. In particular, many studies have been conducted by suppressing the occurrence of PM10 and predicting the concentration of PM10 due to the extreme incidence of respiratory diseases caused by PM10. Since the concentration of PM10, which is the target of prediction, is affected by meteorological factors, time series data have been constructed and predicted based on specific meteorological factors. The existing studies use algorithms that are suitable for time series characteristics by constructing time series data. However, this is simply repeating the process of substituting the algorithm after the data acquisition. So, the characteristics of time series data were not properly used. The first problem with the existing PM10 concentration prediction process is that there is no quantitative mention of the correlation between data including input and output. Second, the data with the characteristics of time series were constructed, but data analysis suitable for the characteristics was not conducted. Third, the calculation of accuracy and error through the algorithm's verification index is in progress, but there is no process of suggesting improvement measures or declaring termination for the development of the actual model through residual diagnosis. This is the process of identifying the possibilities of improvement of the model through residual diagnosis, suggesting the improvement plan, if there is a possibilities of improvement, and declaring the end of the model, if there is no improvement plan.
In this paper, a model for predicting the concentration of PM10 in Seoul is constructed by implementing a deep recurrent neural network (DRNN). In order to predict the concentration of PM10 in Seoul, it is assumed that other cities adjacent to Seoul will have an impact, and regional selection is carried out by presenting a reasonable quantitative basis. The selection of input data for predicting the concentration of PM10 was divided into two parts: the source of PM10, and the increased, and decreased phenomenon after the occurrence. Based on the collected data, the final input data was selected through correlation analysis, and based on the time series data configured in this way, appropriate data analysis is performed. The deep circulating neural network predicts the concentration of PM10 in Seoul based on the selected input data, and evaluates the performance of the algorithm using mean absolute error(MAE), mean square error (MSE), and root mean square error (RMSE) as verification indicators. Finally, the PM10 prediction model is identified through residual diagnosis, and if there is a possibilities of improvement, correction and re-learning are conducted, and if there is no possibilities of improvement, the model will be declared to the end.

주제어

학위논문 정보

저자	오세랑
학위수여기관	전남대학교대학원
학위구분	국내석사
학과	전기및반도체공학과
지도교수	배영철
발행연도	2022
총페이지	81p.
키워드	심층 순환신경망 시계열 분석 데이터 해석 RMSE 모델 검증
언어	kor
원문 URL	http://www.riss.kr/link?id=T16153724&outLink=K
정보원	한국교육학술정보원

표제어: PCR

동의어: Packet Collision Rate

용어 설명 출처 목록 (6)

용어 설명: PCR은 세균 특이성이 있는 primer를 이용하여 적은 수의 세균이 있을지라도 쉽게 검출할 수 있는 유용한 방법이며, 이를 이용하여 구강 내 치면세균막이나 타액에서 직접 세균을 검출할 수 있게 되었다[8].

내보내기 구분	파일저장 인쇄 메일전송
구성항목	기본정보 상세정보 관리번호, 논문명(한글), 저자명(한글), 학위수여기관, 학위연도, 학위구분, 학과, 총페이지, 키워드, 초록(한글), 초록(영문) 관리번호, 논문명(한글), 논문명(영문), 저자명(한글), 저자명(영문), 학위수여기관, 학위연도, 학위구분, 학과, 총페이지, 키워드, 초록(한글), 초록(영문)
저장형식	Text(ASCII format) Excel format
메일정보	받는사람 (필수) @ 보내는사람 (선택) @ 제목 내용 KISTI 검색결과 이메일 서비스
안내	총 건의 자료가 검색되었습니다. 다운받으실 자료의 인덱스를 입력하세요. (1-10,000) 검색결과의 순서대로 최대 10,000건 까지 다운로드가 가능합니다. 데이타가 많을 경우 속도가 느려질 수 있습니다.(최대 2~3분 소요) 다운로드 파일은 UTF-8 형태로 저장됩니다. 파일의 내용이 제대로 보이지 않을실 때는 웹브라우저 상단의 보기 -> 인코딩 -> 자동선택 여부를 확인하십시오. ~ Text(ASCII format) Excel format

연합인증

시계열 분석을 이용한 서울시 미세먼지 농도 예측
Prediction of PM10 Concentration in Seoul using Time Series Decomposition 원문보기

초록 ▼
AI-Helper

Abstract ▼ AI-Helper

주제어

학위논문 정보

이 논문을 인용한 문헌

관련 콘텐츠

원문 보기

이 논문과 함께 이용한 콘텐츠

AI-Helper ※ AI-Helper는 오픈소스 모델을 사용합니다.

선택된 텍스트

연합인증

시계열 분석을 이용한 서울시 미세먼지 농도 예측 Prediction of PM10 Concentration in Seoul using Time Series Decomposition 원문보기

초록 ▼ 용어보기논문에서 용어와 풀이말을 자동 추출한 결과로, 시범 서비스 중입니다. AI-Helper

Abstract ▼ AI-Helper

주제어

학위논문 정보

이 논문을 인용한 문헌

관련 콘텐츠

원문 보기

이 논문과 함께 이용한 콘텐츠

AI-Helper ※ AI-Helper는 오픈소스 모델을 사용합니다.

선택된 텍스트

시계열 분석을 이용한 서울시 미세먼지 농도 예측
Prediction of PM10 Concentration in Seoul using Time Series Decomposition 원문보기

초록 ▼
AI-Helper