잠재요인 모델 기반 영화 추천 시스템
Movie Recommendation System based on Latent Factor Model 원문보기

한국전자통신학회 논문지 = The Journal of the Korea Institute of Electronic Communication Sciences, v.16 no.1, 2021년, pp.125 - 134  

(전남대학교 대학원 컴퓨터공학과) ,  김강철 (전남대학교 전기전자컴퓨터공학부)

영화 산업의 빠른 발전으로 영화의 제작 수가 급격하게 증가하고 있으며, 영화 추천 시스템은 관객들의 과거 행동이나 영화 후기에 기반하여 관객들의 선호도를 예측하여 영화의 선택에 도움을 주고 있다. 본 논문은 평점의 평균과 편향의 보정을 이용하여 잠재요인 모델에 기반한 영화 추천 시스템을 제안한다. 특이값 분해 방법이 평점 매트릭스 분해에 사용되고, 통계 경사 하강법이 최소자승 손실 함수파라미터 최적합에 사용된다. 그리고 평균 제곱근 오차를 사용하여 제안한 시스템 성능을 평가한다. Surprise 패키지를 이용하여 제안한 시스템을 구현 하였으며, 모의실험 결과는 평균 제곱근 오차가 0.671이며, 다른 논문에서 방법에 비하여 좋은 성능을 가진다는 것을 확인하였다.

With the rapid development of the film industry, the number of films is significantly increasing and movie recommendation system can help user to predict the preferences of users based on their past behavior or feedback. This paper proposes a movie recommendation system based on the latent factor mo...


제안 방법

  • the huge data sparseness. In this paper, the latent factor model adds overall average rating, user bias, movie bias and rating prediction range adjustment. Besides, Root Mean Square Error(RMSE) is calculated because it is the most commonly used indicators to measure the accuracy of variables.
  • Item-based collaborative filtering focuses on the similarities between items that a user rates. Model-based collaborative filtering technique fits a model based on the previous rating of users and makes recommendations based on predictions by the fitted model. The typical model-based collaborative filtering is the latent factor model, whose main idea is to comprise an alternative method by transforming both movies and users to the same latent factor space.
  • The proposed model obtains the final adjusted rating matrix after traversing all data training. The last step is to make personalized recommendations for users of 10 movies based on the predicted rating matrix.
  • This paper focuses on sparse data challenge faced by the movie recommendation system, and the rating predicted by the latent factor model requires to add overall average rating and biases to complete further necessary improvements. Besides, the ratings predicted by the proposed model should be in the range of 0-5, which can get more accurate results.
  • This paper proposed the improved latent factor model with adjustment of overall average rating, user bias, movie bias and prediction rating range. The datasets in MovieLens are merged to make a rating matrix.


  • GridSearchCV can try all parameter combinations and report the best measurement parameters. In selecting the tuning parameters, 5-fold cross-validation is performed, and chooses RMSE as the measurement method. Table 6 shows the simulation results have the best RMSE 0.
  • The optimization of the least-square loss function is achieved by using stochastic gradient descent(SGD) algorithm. The algorithm updates the parameters by taking steps in the negative direction of gradient of the objective function, which directly leads towards a local minimum in equation(2) to (5).
  • optimization part. The proposed recommendation system runs a cross-validation procedure for a given algorithm, reporting accuracy measures and RMSE is used to report accuracy.
  1. B. Patel, P. Desai, and U. Panchal, "Methods of recommender system: A review," 2017 International Conference on Innovations in Information, Embedded and Communication Systems (ICIIECS), Coimbatore, India, 2017, pp. 1-4. 

  2. S. Sharma, A. Sharma, Y. Sharma, and M. Bhatia, "Recommender system using hybrid approach," 2016 International Conference on Computing, Communication and Automation (ICCCA), Noida, India, 2016, pp. 219-223. 

  3. K. Shah, A. Salunke, S. Dongare, and K. Antala, "Recommender systems: An overview of different approaches to recommendations," 2017 International Conference on Innovations in Information, Embedded and Communication Systems (ICIIECS), Coimbatore, India, 2017, pp. 1-4. 

  4. L. Chen and C. Kim, "Design of E-Commerce Service on The Web Based on Data Mining", J. of the Korea Institute of Electronic Communication Science, vol. 15, no. 04, Aug. 2020, pp. 703-708. 

  5. S. Jain, A. Grover, P. Thakur, and S. Choudhary, "Trends, problems and solutions of recommender system," International Conference on Computing, Communication & Automation, Noida, India, 2015, pp. 955-958. 

  6. W. Liu, B. Wang, and D. Wang, "Improved Latent Factor Model in Movie Recommendation System," 2018 International Conference on Intelligent Autonomous Systems (ICoIAS), Singapore, Singapore, 2018, pp. 101-104. 

  7. R. Manjula and A. Chilambuchelvan, "Content Based Filtering Techniques in Recommendation System using user preferences," Int. J. Innov. Eng. Technol., vol. 7, no. 4, 2016, pp. 149-154. 

  8. H. Ceong and C. Park, "Enzyme Metabolite Analysis Using Assoication Rules Mining", J. of the Korea Institute of Electronic Communication Sciences, vol. 11, no. 10, 2016, pp. 969-982. 

  9. S. Gong, H. Ye, and H. Tan, "Combining Memory-Based and Model-Based Collaborative Filtering in Recommender System," 2009 Pacific-Asia Conference on Circuits, Communications and Systems, Chengdu, China, 2009, pp. 690-693. 

  10. G. Linden, B. Smith, and J. York, "Amazon.com recommendations: item-to-item collaborative filtering," in IEEE Internet Computing, vol. 7, no. 1, 2003, pp. 76-80. 

  11. S. Kim and D. Cho and S. Bracha, "Design and Implementation of Hashtag Recommendation System Based on Image Label Extraction using Deep Learning", J. of the Korea Institute of Electronic Communication Science, vol. 15, no. 04, Aug. 2020, pp. 709-716. 

  12. Z. Jun-Yao, Z. Zi-Qian, S. Ji-Yun, and C. Jie-Hao, "Solutions to cold-start problems for latent factor models," 2017 17th International Symposium on Communications and Information Technologies (ISCIT), Cairns, Australia, 2017, pp. 1-5. 

  13. Z. Zhang, Y. Xiao, W. Zhu, X. Jiao, K. Zhu and H. Deng, "A context-aware recommendation system based on latent factor model," 2017 13th International Conference on Natural Computation, Fuzzy Systems and Knowledge Discovery (ICNC-FSKD), Guilin, China, 2017, pp. 1-6. 

  14. J. Zeng, "Latent Factor Models for Recommender Systems and Market Segmentation Through Clustering", Master Thesis, The Ohio State University, 2017. 

  15. H. Nguyen and T. Dinh, "A Modified Regularized Non-Negative Matrix Factorization for MovieLens," 2012 IEEE RIVF International Conference on Computing & Communication Technologies, Research, Innovation, and Vision for the Future, Ho Chi Minh City, Vietnam, 2012, pp. 1-5. 

  16. Y. Koren, "Factorization meets the neighborhood: a multifaceted collaborative filtering model," In Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, ACM, New York, U.S.A, 2008. pp. 426-434. 

  17. M. Khoshneshin and W. N. Street, "Collaborative filtering via euclidean embedding", RecSys '10: Proceedings of the fourth ACM conference on Recommender systems, Barcelona Spain, Sep. 2010, pp. 87-94. 

