Determining maintenance policies for partially observable multi-component systems with deep reinforcement learning

Karabag, Oktay

Volume : 31 Issue : 1 Year : 2025

31/1Current Issue Ahead of Print Archive Most Accessed Articles Manuscript Submission

Pamukkale University Journal of Engineering Sciences Determining maintenance policies for partially observable multi-component systems with deep reinforcement learning [Pamukkale Univ Muh Bilim Derg]

Pamukkale Univ Muh Bilim Derg. Ahead of Print: PAJES-33969 | DOI: 10.5505/pajes.2024.33969

Determining maintenance policies for partially observable multi-component systems with deep reinforcement learning

Oktay Karabag
�zmir University Of Economics

In this study, maintenance decisions for partially observable multi-component systems are investigated. Such systems typically operate under conditions where the service provider is remote, and the wear levels of system components cannot be fully monitored with sensors� assistance. Wind turbines provide a good example of these systems. For such systems, besides deciding when the service provider will perform a maintenance intervention, it is also necessary to determine which parts will be taken along to the maintenance point and which components will be replaced after the inspection at the maintenance point. In our study, this complex decision problem is modeled as a partially observable Markov decision process, and related numerical solutions are obtained employing the actor-critic reinforcement learning method. Our numerical studies demonstrate that the policies obtained with the reinforcement learning algorithm outperform several heuristic maintenance policies that are frequently used in practice and well-known in the relevant literature. In some cases, compared to heuristic policies, these solutions have provided a cost reduction in the range of 10-15% on average. Additionally, it has been observed that the solution obtained with the reinforcement learning algorithm provides more advantages compared to heuristic policies, as the corrective maintenance cost, emergency order cost, and returning cost of excess spare parts increase.

Keywords: Partially observable multi-component systems, Partially observable Markov decision processes, Reinforcement learning methods, Con-dition-based maintenance problems.

K�smi g�zlemlenebilir �ok bile�enli sistemler i�in bak�m politikalar�n�n peki�tirmeli derin ��renme y�ntemleri ile belirlenmesi

Oktay Karabag
�zmir Ekonomi �niversitesi

Bu �al��mada, k�smi g�zlemlenebilir �ok bile�enli sistemler i�in bak�m/onar�m kararlar� incelenmi�tir. Bu tip sistemler genellikle servis sa�lay�c�n�n uzakta oldu�u ko�ullarda i�letilmekte ve bile�enlerin a��nma seviyeleri genellikle sens�rler yard�m� ile tam olarak izlenememektedir. R�zg�r t�rbinleri, bu tarz sistemlere birebir uyan bir �rnek olu�turmaktad�r. �lgili sistemlerde, servis sa�lay�c� ne zaman bak�m/onar�m yapaca��na, bak�m karar� ile birlikte hangi par�alar� bak�m noktas�na sevk edece�ine ve bak�m noktas�ndaki incelemesinin ard�ndan hangi sistem bile�enlerinin de�i�tirilmesi gerekti�ine karar vermektedir. �al��mam�zda, bahsi ge�en bu komplike karar problemi k�smi g�zlemlenebilir Markov karar s�reci olarak modellenmi� ve ilgili n�merik ��z�mler akt�r kritik peki�tirmeli ��renme y�ntemi kullan�larak elde edilmi�tir. Yapt��m�z n�merik �al��malar, peki�tirmeli ��renme algoritmas� ile elde edilen ��z�mlerin pratikte ve literat�rde yayg�n olarak kullan�lan sezgisel bak�m/onar�m politikalar�na k�yasla daha iyi sonu�lar verdi�ini g�stermi�tir. Baz� durumlarda, bu ��z�mlerin ortalamada %10-%15 d�zeyinde bir iyile�tirme sa�lad�� g�zlemlenmi�tir. Ayr�ca, d�zeltici bak�m maliyeti, acil sipari� maliyeti ve fazla yedek par�ay� geri d�nd�rme maliyeti artt�k�a, peki�tirmeli ��renme algoritmas� ile elde edilen ��z�mlerin di�er sezgisel politikalara k�yasla daha fazla avantaj sa�lad�� da belirlenmi�tir.

Anahtar Kelimeler: K�smi g�zlemlenebilir �ok bile�enli sistemler, k�smi g�zlemlenebilir Markov karar s�re�leri, Peki�tirmeli ��renme metotlar�, Ko�ula ba�l� bak�m problemleri.

Corresponding Author: Oktay Karabag, T�rkiye
Manuscript Language: Turkish

CITE

Full Text PDF Download citation RIS EndNote BibTex Medlars Procite Reference Manager Send email to author Similar articles Google Scholar