792
2 DATASOURCEANDPARAMETERANALYSIS
2.1 Datasource
The AIS data of the ships used in this paper is all
provided by ShangHai Maili Marine Technology
Co.,Ltd (http://www.hifleet.com) and stored after
receivinganddecoding.
Theshipsinvolvedinthispaperarealloceanbulk
carriersusedforinternationaliron
oretrade,andthe
information of such ships are included in Lloydʹs
maritime archives. Therefore, in order to avoid the
situationthatship’sAISdataismis‐inputbysailors,
this paper combines Lloydʹs maritime archives to
improvethestaticinformationofAISdata.
Theactualshipmentsize
oftheshipsusedinthis
paperisderivedfromtheLine‐upofportagencyLBH
whichincludesshipname,departureport,departure
time, shipment size, destination, etc. In order to
excludetheimpactofotherfactors,theselectedships
areallbulkcarriersdeliveredfromAustraliadirectly
to China,
and multi‐port unloading ships have been
excluded.
Match the Lloydʹs maritime files with ship AIS
databyMaritimeMobileServiceIdentify(MMSI)and
shipnametogetamorecompleteandaccurateship
staticinformation(name,callsign,MMSI,IMO,vessel
type,designeddraft,length,width,etc.)and
dynamic
information (longitude, latitude, speed, draft, etc.).
Then,matchthetrueshipmentsizeforeachshipwith
Line‐up through the name, departure port and
departuretime.
2.2 Characteristicextraction
AIS data include static information, dynamic
informationandvoyage‐relatedinformation,there is
no obvious linear relationship between all kinds of
information, so we need to select the appropriate
informationfornonlinear modeling, in order to find
the nonlinear mapping relationship between the
ship’s AIS messages and actual shipment size. BP
neural network has obvious advantages in the
modelingofnonlinearrelationsandiswidelyusedin
information fusion (see Hu
X et al 2011) , track
prediction (see Gan S et al 2016; Rong Zhen 2017),
informationrecognition(seeZhangetal2009;Zhuet
al2012)andotherstudies.
After analysis and screening, the following
featuresareselected:
1 Length and width of vessel. It is a major factor
affecting
the shipment size because it’s closely
relatedtotheshipʹscargocapacity.
2 Deadweight ton. Deadweight ton = Displacement
ofthefullyloadedship‐theweightofemptyship.
It can reflect the shipment size when the ship is
fullyloaded.
3 Designeddraft.Itisthedraftwhen
theshipisfully
loaded.
4 Currentdraft. It changeswhen the shipment size
changes.
Theratioofcurrentdrafttodesigneddraftandthe
ratio of shipment size to deadweight ton are
theoreticallypositivelycorrelated.
Taketheabovefivefeaturesasinputsandthetrue
shipment size as output of
BP neural network, and
obtainthemappingrelationshipbetweenthefeatures
and the true shipment size through supervised
learning.
2.3 Dataprocessingandstorage
MatchtheAISdatawiththeLine‐updatathroughthe
shipname,departureportanddeparturetimetoget
the actual shipment size. Store the
AIS data with
actual shipment size in My SQL database, and set
MMSI as the primary key for convenience of query
andchange.Storagestructureisshowninfigure1.
Datapreprocessing:
1 Dynamic data. The draft of the ship is entered
manuallybythecrew,sotheinformationmaybe
entered incorrectly or changed untimely, or the
draftforlastvoyagemayberetained.Forthedata
with unreasonable draft, search the ships’
historicaltrackonhttp://www.hifleet.comthrough
MMSI.Observeitsdraftchangesafterdeparture.If
there is any changes before arriving next port,
choosethelatter.
2 Static data.
The AIS data were matched with
Lloydʹs Marine archives through MMSI and ship
name,soastoimprovetheaccuracyofAISstatic
data. Through preprocessing, the accuracy and
integrityoftrainingdataareguaranteed.
Figure1.Datastoragesample
3 ESTIMATIONMODELOFSHIPMENTSIZE
BASEDONBPNEURALNETWORK
3.1 Networkstructure
BP neural network is a kind of back propagation
learning algorithm (see Yu 2011) proposed by the
teamofscientistsledbyRumelhartandMcMellandin
1986. It can store a large number of input‐output
mode
mapping relations. Its learning rule is the
steepestdescentmethod.Theweightsandthresholds
of the network are continuously adjusted by back
propagation to minimize the network errors. BP
neural network consists of a series of simple units
whicharecloselyrelatedtoeachother.Itstopological
structure includes input
layer, hidden layer and
outputlayer.HerewebuildaBPneuralnetworkwith