An effective method to use centralized Q-learning in multi-robot task allocation

Ezercan Kay�r, Hatice Hilal

Volume : 30 Issue : 2 Year : 2024

30/2Current Issue Ahead of Print Archive Most Accessed Articles Manuscript Submission

An effective method to use centralized Q-learning in multi-robot task allocation [Pamukkale Univ Muh Bilim Derg]

Pamukkale Univ Muh Bilim Derg. 2021; 27(5): 579-588 | DOI: 10.5505/pajes.2021.90490

An effective method to use centralized Q-learning in multi-robot task allocation

Hatice Hilal Ezercan Kay�r
Pamukkale University, Engineering Faculty, Department of Electrical and Electronics Enginering

The use of Q-learning methods in multi-robot systems is a challenging area. Multi-robot systems have dynamic and partially observable nature because of robot�s independent decision-making and acting mechanisms. Whereas, Q-learning is defined on Markovian environments theoretically. One way to apply Q-learning in multi robot systems is centralized learning. It learns optimal Q-values for state space of overall system and joint action spaces of all agents. In this case, the system can be considered as stationary and optimal solutions can be converged. But, centralized learning requires full knowledge of the environment, perfect inter-robot communication and good computational power. Especially for large systems, the computational cost becomes huge because of exponentially growing learning space size with the number of robots. The proposed approach in this study, subG-CQL, divides the overall system into small-sized sub-groups without adversely affecting the system's task performing abilities. Each sub-group consists of less number of robots performing less tasks and learns in centralized manner for its own team. So, the learning space dimension is reduced to a reasonable level and required communication remains limited to the robots in the same the sub-group. Due the centralized learning is used, it is expected that the successful results are achieved. Experimental studies show that the proposed algorithm provides increase in the task assignment performance of the system and efficient use of system resources.

Keywords: Multi-Robot Systems, Task Allocation, Q-learning, Centralized Learning

�ok robotlu g�rev atama probleminde merkezi Q-��renme kullanmak i�in etkili bir y�ntem

Hatice Hilal Ezercan Kay�r
Pamukkale �niversitesi, M�hendislik Fak�ltesi, Elektrik-elektronik M�hendisli�i B�l�m�

�ok robotlu sistemlerde Q-��renme y�nteminin kullan�m� olduk�a problemlidir. �ok robotlu sistemlerde, robotun ba��ms�z karar verme ve hareket etme mekanizmalar� nedeniyle dinamik ve k�smen g�zlemlenebilir yap�ya sahiptir. Oysa, Q-��renme y�ntemi teorik olarak Markovian olarak nitelendirilebilecek ortamlar �zerinde tan�mlanm��t�r. �ok robotlu sistemlerde Q-��renmeyi uygulaman�n bir yolu, merkezi ��renmedir. Merkezi ��renme, t�m sistemin durum uzay� ve t�m robotlar�n t�mle�ik hareket uzaylar� i�in optimal Q-de�erlerini ��renir. Bu durumda, sistem statik olarak de�erlendirilmekte ve optimal ��z�m yak�nsama m�mk�n olmaktad�r. Ancak, merkezi ��renme, �evre hakk�nda tam bilgi edinmeyi, robotlar aras� iyi bir haberle�me sa�lanmas�n� ve iyi hesaplama g�c� gerektirir. �zellikle b�y�k sistemler i�in, robot say�s�ndaki art��la birlikte �stel b�y�yen ��renme uzay� boyutu nedeniyle hesaplama maliyeti �ok y�ksek olmaktad�r. Bu �al��mada �nerilen yakla��m olan subG-CQL, sistemin g�rev yapma yeteneklerini olumsuz y�nde etkilemeden genel sistemi k��k boyutlu alt gruplara ay�r�r. Her bir alt grup daha az say�da robottan olu�ur, daha az g�rev yapar ve kendi ekibi i�in merkezi bir �ekilde ��renir. B�ylece ��renme alan� boyutu makul bir d�zeye indirilir ve gerekli ileti�im ayn� alt gruptaki robotlarla s�n�rl� kal�r. Merkezi ��renmenin kullan�lmas� nedeniyle ba�ar�l� sonu�lara ula��lmas� beklenmektedir. Deneysel �al��malar, �nerilen algoritman�n sistemin g�rev atama performans�nda art�� ve sistem kaynaklar�n�n verimli kullan�m�n� sa�lad��n� g�stermektedir.

Anahtar Kelimeler: �ok Robotlu Sistemler, G�rev Atama, Q-��renme, Merkezi ��renme

Hatice Hilal Ezercan Kay�r. An effective method to use centralized Q-learning in multi-robot task allocation. Pamukkale Univ Muh Bilim Derg. 2021; 27(5): 579-588

Corresponding Author: Hatice Hilal Ezercan Kay�r, T�rkiye
Manuscript Language: English

TOOLS Full Text PDF Print Download citation RIS EndNote BibTex Medlars Procite Reference Manager Share with email Share Send email to author Similar articles Google Scholar