746
2 NEUROEVOLUTION
Neuroevolution is an artificial intelligence method
that uses evolutionary algorithms (EA) to generate
artificial neural networks (ANN), its parameters,
topology and rules. Such combination gives the
advantage of flexibility and adaptability, which
allows to adjust the computational structures to the
dynamicallychangingconditionsencounteredduring
ship maneuvering and
are intensively studied and
implementedin many differentfields of science and
research, including robotics (Lee et al., 2013),
automation processes (Stanley et al., 2005), multi‐
agent systems (Nowak et al., 2008) designing and
diagnostics(Larkinetal.,2006),andmanymore.
Neuroevolutionary algorithms are successful
methods for optimizing neural
networks topologies
fordynamiccontinuousreinforcementlearningtasks.
Their significant advantage over gradient‐based
algorithms is the capability to modify network
topologies along with connection weights, resulting
withbroadersearchspaceofpossiblesolutions.
The proper maneuvers of ship maneuvering
accordingtoCOLREGsrulesisessentialtothesafety
of people,
equipment, cargo and the environment.
Increaseofcomputationalpowerofelectronicdevices
allows to implement as complex algorithms as
neuroevolution into advanced decision support
systems(DSS)alsointhefieldofmarinenavigation.
Throughcontinuousenvironmentobservationand
learning process, such DSS shall predict the vessels’
rudderangleandpropeller
revolutionsasaccurate as
possibletoensuresafeimplementationofchosenrule.
It is possible to calculate these output signals when
thereisasimulationmodelofthevesselavailable.
InneuroevolutionANNistreatedasanindividual
inapopulationofmultiplenetworks.Neuroevolution
is able to find a solution
of a complex and
dynamically changing task with ANN created and
modifiedwithEA. Thebasictopologiesof theinitial
populationarerandomlydeterminedatthebeginning
oflearningprocess.Eachindividualactsasabrainof
artificialhelmsmanandbeginstheprocessoffinding
a solution with the same
starting parameters (ships
initial position, course, velocity, rudder angle, rpm).
Theactionofeachindividualisusuallyassessedwith
thereinforcementlearningalgorithms(Stanleyetal.,
2005)andevolutionarystageofthesystemshallselect
individuals best suited to the task during selection
stage, which determines the whole population to
improve
itsgeneticmaterialovertime.
Evaluation of each individual is being processed
duringwholesimulationaftersomeimportantevents
tookplace,asforexample:
movingthevesseloutoftheareaoronforbidden
sector, i.e. the safety domain of an encountered
vessel,
making rapid and/or frequently changing
maneuvers,i.e.tofrequentalterationofrpm,
leadingtoimproper ships’ movement parameters
values,i.e.linearand/orangularvelocity too low
ortoohigh,
movingthevesselawayfromgoal,
reachingagoal.
Alltheseeventmustbearbitrarilyrated,resulting
inarewardtoevaluatedindividual,thus
valuatingits
fitness important in evolutionary stage of the
algorithm and consequently his chance of
reproductionandsurvivaltothenextgeneration.
Evolutionary process of the system consists three
mainsteps:
selectionofthebestindividualorindividuals,
reproduction (with cross‐over and mutation sub‐
processes),
replacement
(offspring replaces worst
individuals).
Forthepurposeofthistasktheneuroevolutionary
method, the modified NEAT algorithm, with direct
encoding of neural network topology has been
implemented.
NEAT (NeuroEvolution of Augmenting
Topologies) (Stanley and Risto, 2002) adjust the
topology of ANN’s with EA gradually, allows to
obtainasetofindividuals
thatarebestfittedtogiven
task.
Input and output signals of ANN’s have been
determinedatthebeginningofdesigningphaseofthe
system.Properlydesignedsetofsignalsconsideredin
the model is crucial for efficiency of the system as
muchasforitsfidelityandaccuracyin
comparisonto
therealnavigationalsituation.
Inputsignalsinthesystem,withthreedegreesof
freedomofthevesselmovement,areasfollows:
ships’courseoverground,
ships’angularvelocity,
ships’speedoverground,
ships’position,
ships’distanceandangletogoal,obstaclesandto
the
encounteredvessel,
mainpropellerrevolutions(currentandpreset),
rudders’deflection(currentandpreset).
Infutureresearchothersignalsfromenvironment
maybetakenintoaccount,i.e.wind,current,waves,
cargo,trimandroll,ifdeliveredinashipmodel.
Outputsignals of ANNs generates the values for
steering
thevessel:
rpmofmainpropeller,
rudders’deflection.
Alloftheinputandoutputsignalsarenormalized
andencodedasrealvaluesbetween0and1.
Each node in ANN represents a neuron that
producesarealvaluebetween0and1asaresultof
normalized weighted sum
of its inputs.
Normalization of weighted sum is performed with
sigmoidfunction.
The simulation results are shown below for
simulationmodel of three‐degrees‐of‐freedom VLCC
crude oil tanker “Esso Norway” with the single‐
propellerandsingle‐rudder(Figure1).