[논문]국내 주택시장의 주택 보유기간 및 매도 의사결정에 대한 머신러닝 예측모델 비교

김은미

국내 주택시장의 주택 보유기간 및 매도 의사결정에 대한 머신러닝 예측모델 비교
Comparison of Prediction Models Using Machine Learning of Housing Tenure and Decision-making on Housing Sales in the Korean Housing Market 원문보기

김은미 (한성대학교 대학원 경제부동산학과 부동산 경제학 국내박사)

초록 ▼
AI-Helper

본 연구는 OLS모형을 적용하여 주택 보유기간에 영향을 미치는 결정요인을 추정한 후 SVM, Decision Tree, Random Forest, Gradient Boosting, XGBoost, LightGBM등의 머신러닝을 통해 각 모형별 예측력을 비교하였다. 예측력이 높은 머신러닝 모형을 기반 삼아 Stacking방법을 적용하여 더욱 예측력이 높은 모형을 구축하였다. 이를 통해 주택시장의 주택거래량을 파악할 수 있다는 점에 선행 연구와의 차이가 있다. OLS분석 결과 매도이익, 주택가격, 가구원 수, 거주주택형태 중 단독주택, 아파트가 주택 보유기간에 영향을 미치는 것으로 나타났다.
각 머신러닝 모형과 OLS의 RMSE를 비교한 결과 머신러닝 모형의 RMSE가 낮게 나타났고, 이는 OLS보다 예측력이 더 높은 것으로 파악되었다. 이후, 주택 보유기간에 영향을 미치는 변수로 데이터를 재구축한 후 각 머신러닝을 적용하여 예측력을 비교하였으며, 분석 결과 Random Forest의 예측력이 가장 우수한 것으로 나타났다. 또한 예측력이 가장 높은 Random Forest, Decision Tree, Gradient Boosting, XGBoost모형을 개별모형으로 적용하고, Linear, Ridge, Lasso모형을 메타모델로 하여 Stacking 모형을 구축하였다. 분석 결과, Ridge모형일 때 RMSE값이 0.5868로 가장 낮게 나타나 예측력이 가장 높음을 확인할 수 있었다.
주택매도 의사결정 시 각 모형별 예측력 비교를 위해 로지스틱 회귀모형, Random Forest, XGBoost, LightGBM, Decision Tree, Gradient Boosting, MLP의 머신러닝 모형을 이용하여 주택 소유자들이 주택가격의 이익 및 손실이 있을 때, 주택매도 의사결정에 영향을 미치는 요인을 분석하였다.
이익집단의 분석결과 모든 모형에서 주택 매도결정에 가장 유의한 영향을 미치는 변수는 주택면적으로 나타났다. 주택면적은 주택 가격과 밀접한 관련이 있으며, 손익을 대변하는 변수로 생각할 수 있기 때문에 이익집단에서 매도결정에 큰 영향을 미칠 수 있는 것으로 보인다. 또한 손실집단은 총 자산과 주택면적이 주택가격에 손실이 있을 때 매도 결정에 유의한 영향을 미치는 것으로 나타났다. 각 집단 별 머신러닝 모형의 Mean Test Score를 통해 예측력을 비교한 결과 MLP의 예측력이 가장 높은 것으로 나타났다.
본 연구는 각 집단 별 머신러닝 모형의 평가를 위해 정확도(Accuracy), 정밀도(Precision), 재현율(Recall), F1, ROC_AUC곡선을 적용하였으며, 이익・손실집단 모두 MLP의 ROC_AUC값이 0.94, 0.92로 가장 좋은 성능을 나타냈다.

Abstract ▼ AI-Helper

This study used the OLS model to estimate the determinants affecting the tenure of a house and then compared the predictive power of each model with SVM, Decision Tree, Random Forest, Gradient Boosting, XGBoost and LightGBM. There is a difference from the preceding studies in that the volume of housing transactions in the housing market can be identified through Stacking model, one of the ensemble models. OLS analysis showed that sales profits, housing prices, the number of household members, and the type of residential housing (detached houses and apartments) affected the period of housing ownership, and compared the predictability of the machine learning model with RMSE, the results showed that the machine learning model had higher predictability. Afterwards, the predictive power was compared by using machine learning after rebuilding the data with the influencing variables, and the analysis showed Random Forest had the best predictive power. In addition, the most predictable Random Forest, Decision Tree, Gradient Boosting, and XGBoost with the highest predictive power were applied as individual models, and the Stacking model was built using Linear, Ridge, and Lasso as meta models. As a result of the analysis, the RMSE value in the Ridge model was the lowest at 0.5868, thus building the highest predictive model.
To compare the predictive power of each model in making decisions on hosing sales, logistic regression models, Random Forest, XGBoost, LightGBM, Decision Tree, Gradient Boosing and MLP with machine learning algorithms were used to analyze the factors that affect the decision-making on housing sales if profit or loss on housing price is caused. Also, the results of the analysis for each model were also compared with the predictive power through the ROC_AUC curve.
The analysis of interest groups showed that the most significant variable in determining the sale of a house in all models was the housing area. The housing area refers to the size of a house, which seems to have a huge impact on the decision to sell the house in the interest group because the larger the size, the larger the loss could be. It is shown that housing size influenced both profit and loss groups’ decisions on selling houses, and the total debt affects only the loss group’s. As a result of comparing the predictive power through the ROC_AUC values of each model, it is considered that the predictive power of machine learning is generally similar for the reason that the ROC_AUC values are similar.

주제어

학위논문 정보

저자	김은미
학위수여기관	한성대학교 대학원
학위구분	국내박사
학과	경제부동산학과 부동산 경제학
지도교수	김상봉
발행연도	2020
총페이지	vii,_94_p.
키워드	머신러닝 딥러닝 주택 보유기간 주택매도 의사결정 ROC_AUC
언어	kor
원문 URL	http://www.riss.kr/link?id=T15642663&outLink=K
정보원	한국교육학술정보원

표제어: PCR

동의어: Packet Collision Rate

용어 설명 출처 목록 (6)

용어 설명: PCR은 세균 특이성이 있는 primer를 이용하여 적은 수의 세균이 있을지라도 쉽게 검출할 수 있는 유용한 방법이며, 이를 이용하여 구강 내 치면세균막이나 타액에서 직접 세균을 검출할 수 있게 되었다[8].

내보내기 구분	파일저장 인쇄 메일전송
구성항목	기본정보 상세정보 관리번호, 논문명(한글), 저자명(한글), 학위수여기관, 학위연도, 학위구분, 학과, 총페이지, 키워드, 초록(한글), 초록(영문) 관리번호, 논문명(한글), 논문명(영문), 저자명(한글), 저자명(영문), 학위수여기관, 학위연도, 학위구분, 학과, 총페이지, 키워드, 초록(한글), 초록(영문)
저장형식	Text(ASCII format) Excel format
메일정보	받는사람 (필수) @ 보내는사람 (선택) @ 제목 내용 KISTI 검색결과 이메일 서비스
안내	총 건의 자료가 검색되었습니다. 다운받으실 자료의 인덱스를 입력하세요. (1-10,000) 검색결과의 순서대로 최대 10,000건 까지 다운로드가 가능합니다. 데이타가 많을 경우 속도가 느려질 수 있습니다.(최대 2~3분 소요) 다운로드 파일은 UTF-8 형태로 저장됩니다. 파일의 내용이 제대로 보이지 않을실 때는 웹브라우저 상단의 보기 -> 인코딩 -> 자동선택 여부를 확인하십시오. ~ Text(ASCII format) Excel format

연합인증

국내 주택시장의 주택 보유기간 및 매도 의사결정에 대한 머신러닝 예측모델 비교
Comparison of Prediction Models Using Machine Learning of Housing Tenure and Decision-making on Housing Sales in the Korean Housing Market 원문보기

초록 ▼
AI-Helper

Abstract ▼ AI-Helper

주제어

학위논문 정보

이 논문을 인용한 문헌

관련 콘텐츠

원문 보기

이 논문과 함께 이용한 콘텐츠

AI-Helper ※ AI-Helper는 오픈소스 모델을 사용합니다.

선택된 텍스트

연합인증

국내 주택시장의 주택 보유기간 및 매도 의사결정에 대한 머신러닝 예측모델 비교 Comparison of Prediction Models Using Machine Learning of Housing Tenure and Decision-making on Housing Sales in the Korean Housing Market 원문보기

초록 ▼ 용어보기논문에서 용어와 풀이말을 자동 추출한 결과로, 시범 서비스 중입니다. AI-Helper

Abstract ▼ AI-Helper

주제어

학위논문 정보

이 논문을 인용한 문헌

관련 콘텐츠

원문 보기

이 논문과 함께 이용한 콘텐츠

AI-Helper ※ AI-Helper는 오픈소스 모델을 사용합니다.

선택된 텍스트

국내 주택시장의 주택 보유기간 및 매도 의사결정에 대한 머신러닝 예측모델 비교
Comparison of Prediction Models Using Machine Learning of Housing Tenure and Decision-making on Housing Sales in the Korean Housing Market 원문보기

초록 ▼
AI-Helper