Have a personal or library account? Click to login
Reinforcement learning in discrete and continuous domains applied to ship trajectory generation Cover

Reinforcement learning in discrete and continuous domains applied to ship trajectory generation

By: Andrzej Rak and  Witold Gierusz  
Open Access
|Oct 2012

Abstract

This paper presents the application of the reinforcement learning algorithmsto the task of autonomous determination of the ship trajectory during thein-harbour and harbour approaching manoeuvres. Authors used Markovdecision processes formalism to build up the background of algorithmpresentation. Two versions of RL algorithms were tested in the simulations:discrete (Q-learning) and continuous form (Least-Squares Policy Iteration).The results show that in both cases ship trajectory can be found. Howeverdiscrete Q-learning algorithm suffered from many limitations (mainly curseof dimensionality) and practically is not applicable to the examined task. On the other hand, LSPI gavepromising results. To be fully operational, proposed solution should be extended by taking into accountship heading and velocity and coupling with advanced multi-variable controller.

DOI: https://doi.org/10.2478/v10012-012-0020-8 | Journal eISSN: 2083-7429 | Journal ISSN: 1233-2585
Language: English
Page range: 31 - 36
Published on: Oct 31, 2012
Published by: Gdansk University of Technology
In partnership with: Paradigm Publishing Services
Publication frequency: 4 issues per year

© 2012 Andrzej Rak, Witold Gierusz, published by Gdansk University of Technology
This work is licensed under the Creative Commons License.