[논문]Data-driven inventory management modeling using entropy-regularized deep reinforcement learning

정지원

Data-driven inventory management modeling using entropy-regularized deep reinforcement learning 원문보기

정지원 (Graduate School, Yonsei University Department of Industrial Engineering 국내박사)

초록 ▼
AI-Helper

효과적인 재고 관리 시스템은 기업 성공 좌우에 무시못할 영향을 끼치게 됩니다. 의류 산업과에도 소매점의 효과적인 재고 관리는 기업이 높은 보관 비용이 발생하지 않도록 적절하고 충분한 재고량으로 품목이 고갈되지 않고 안전한 보관 수준을 유지하는 과정으로 경쟁력 및 고객 충성도를 유지하는 핵심 성공 요인입니다. 그러나 불행하게도 경쟁력 있는 재고 관리 시스템은 도전 없이는 이루기가 어렵습니다. 4차 산업혁명의 도래와 함께 자율적으로 비실용적인 산업 문제를 해결하기 위해 인공지능 전문성의 필수가 높아졌습니다.
인공지능 연구는 기계 학습을 기반으로 하며 그 중에 강화학습도 포함됩니다. 변수의 차원 및 문제의 규모가 점점 현실화됨에 따라 심층강화학습이 등장했습니다. 연구 단계의 초시점에서는 Atari 게임을 시뮬레이션하기 위해 많이 사용하게 됐습니다. Atari 게임에서 고성능 결과의 달성을 보여 본 게임은 점진적으로 개발된 최첨단 심층 강화 학습 모델 개발에 프레임워크가 되어줬습니다. 본 연구에서는 Soft Ac-tor-Critic에 중점을 두고 의류 Make-to-stock 재고 관리 문제를 해결하기 위해 심층강화학습 방법에 대한 연구를 수행했고 또한, 비용 부분에서 효율적인 재고 관리를 더욱 촉진하기 위해 Total Penalty로 지명된 새로운 보상 함수를 제안합니다.
본 연구에서 사용된 알고리즘은 재고 관리의 가장 중요한 핵심 성과 지표 중 일부인 Service Level, Sell-through Rate 및 Inventory-to-sales ratio 기반으로 서로 평가됩니다. Soft Actor-Critic 이 가장 낮은 총 비용으로 재고량의 흐름을 유지하고 수요를 충족하는 데 있어서 제일 뛰어난 성능을 보였습니다. 핵심 성과 지표 측면에서 Soft Actor-Critic은 과잉 재고 없이 판매할 수 있는 재고의 가용성을 높임으로써 서비스 수준과 판매율 간의 균형을 개선하면서 재고 관리 시스템을 운영했습니다.
Soft Actor-Critic은 Twin Delayed Deep Deterministic Policy Gradient(총 비용을 보상 함수로 학습) 및 (R, Q) – Policy (총 페널티를 보상 함수로 학습) 보다 2.42% 및 31.91% 더 낮은 총 비용을 달성했습니다. 또한 SAC는 (R, Q) – Policy 보다 81.39% 낮은 재고 대비 매출 비율을 달성하여 재고 재고를 가용한 상태로 판매하고 비용을 최적화하는 더 우수한 능력을 보여줬습니다.

Abstract ▼ AI-Helper

The success of enterprises heavily relies on their inventory management systems, which largely determine their competitiveness and customer loyalty. This case is also relevant in the garment industries. The primary goal of establishing a successful inventory management system is to maintain safe storage level of items without risking depletion, thereby avoiding excessive storage costs. Unfortunately, achieving a competitive inventory management system poses challenges. With the emergence advent of the Fourth Industrial Revolution, expertise in Artificial Intelligence has become essential for addressing complex industrial problems autonomously.
The study of Artificial Intelligence is founded on Machine Learning, and Reinforcement Learning is part of it. As the dimensions of variables and complexity of problems increase, Deep Reinforcement Learning has emerged by combining Deep Learning and Reinforcement Learning. Its initial prominence was seen in the simulation of Atari games, where it served as framework for developing state-of-the-art Deep Reinforcement Learning models by achieving high-performance results. This study focuses on applying Deep Reinforcement Learning methods to address an apparel Make-to-stock inventory management focusing specially on Soft Actor-Critic. Additionally, a novel reward function called Total Penalty is proposed to enhance cost-efficient inventory management.
The models are evaluated based on critical Key Performance Indicators, including sell-through rate, service level, and inventory-to-sales ratio. Soft Actor-Critic, in both scenarios when trained with Total Cost and Total Penalty as reward functions, demonstrated superior inventory management performance by meeting demands at the lowest Total Cost. Furthermore, Soft Actor-Critic achieved a favorable balance between service level and sell-through rate within large span by ensuring optimal stock availability without excessive overstocking.
Compared to Twin Delayed Deep Deterministic Policy Gradient (with Total Cost as reward function), and (R, Q) – policy (with Total Penalty as reward function) models, Soft Actor-Critic achieved a 2.42% and 31.91% lower Total Cost, respectively. Additionally, Soft Actor-Critic achieved an 81.39% lower inventory-to-sales ratio than (R, Q) – policy, indicating its superior ability to optimize costs and make inventory stocks available for sales.

주제어

학위논문 정보

저자	정지원
학위수여기관	Graduate School, Yonsei University
학위구분	국내박사
학과	Department of Industrial Engineering
지도교수	Wooju Kim
발행연도	2023
총페이지	viii, 84장
키워드	Machine Learning Deep Reinforcement Learning Make-to-Stock Inventory Management Soft Actor-Critic 머신러닝 심층 강화학습 재고 관리 생산 방식 액터 크리틱
언어	eng
원문 URL	http://www.riss.kr/link?id=T16809838&outLink=K
정보원	한국교육학술정보원

내보내기 구분	파일저장 인쇄 메일전송
구성항목	기본정보 상세정보 관리번호, 논문명(한글), 저자명(한글), 학위수여기관, 학위연도, 학위구분, 학과, 총페이지, 키워드, 초록(한글), 초록(영문) 관리번호, 논문명(한글), 논문명(영문), 저자명(한글), 저자명(영문), 학위수여기관, 학위연도, 학위구분, 학과, 총페이지, 키워드, 초록(한글), 초록(영문)
저장형식	Text(ASCII format) Excel format
메일정보	받는사람 (필수) @ 보내는사람 (선택) @ 제목 내용 KISTI 검색결과 이메일 서비스
안내	총 건의 자료가 검색되었습니다. 다운받으실 자료의 인덱스를 입력하세요. (1-10,000) 검색결과의 순서대로 최대 10,000건 까지 다운로드가 가능합니다. 데이타가 많을 경우 속도가 느려질 수 있습니다.(최대 2~3분 소요) 다운로드 파일은 UTF-8 형태로 저장됩니다. 파일의 내용이 제대로 보이지 않을실 때는 웹브라우저 상단의 보기 -> 인코딩 -> 자동선택 여부를 확인하십시오. ~ Text(ASCII format) Excel format

연합인증

Data-driven inventory management modeling using entropy-regularized deep reinforcement learning 원문보기

초록 ▼
AI-Helper

Abstract ▼ AI-Helper

주제어

학위논문 정보

이 논문을 인용한 문헌

관련 콘텐츠

원문 보기

이 논문과 함께 이용한 콘텐츠

AI-Helper ※ AI-Helper는 오픈소스 모델을 사용합니다.

선택된 텍스트

연합인증

Data-driven inventory management modeling using entropy-regularized deep reinforcement learning 원문보기

초록 ▼ 용어보기논문에서 용어와 풀이말을 자동 추출한 결과로, 시범 서비스 중입니다. AI-Helper

Abstract ▼ AI-Helper

주제어

학위논문 정보

이 논문을 인용한 문헌

관련 콘텐츠

원문 보기

이 논문과 함께 이용한 콘텐츠

AI-Helper ※ AI-Helper는 오픈소스 모델을 사용합니다.

선택된 텍스트

초록 ▼
AI-Helper