METHOD AND APPARATUS FOR CONTROLLING A COMMUNICATIVELY ISOLATED WATERCRAFT

Abstract

A method of training a machine learning, ML, algorithm to control a watercraft is described. The watercraft is a submarine or a submersible submerged in water. The method is implemented, at least in part, by a computer, comprising a processor and a memory, aboard the watercraft. The method comprises: obtaining training data including respective sets of environmental parameters and corresponding actions of a set of communicatively isolated watercraft, including a first watercraft; and training the ML algorithm comprising determining relationships between the respective sets of environmental parameters and the corresponding actions of the watercraft of the set thereof. A method of controlling a watercraft by a trained ML algorithm is also described.

Claims

1. A method of training a machine learning, ML, algorithm to control a watercraft, wherein the watercraft is a submarine or a submersible submerged in water, the method implemented, at least in part, by a computer, comprising a processor and a memory, aboard the watercraft, the method comprising: obtaining training data including respective sets of environmental parameters and corresponding actions of a set of communicatively isolated watercraft, including a first watercraft; and training the ML algorithm comprising determining relationships between the respective sets of environmental parameters and the corresponding actions of the watercraft of the set thereof.

2. The method according to claim 1, wherein determining the relationships between the respective sets of environmental parameters and the corresponding actions of the watercraft of the set thereof comprises detecting boundaries related to the environmental parameters and relating the corresponding actions to the detected boundaries.

3. The method according to claim 1, wherein obtaining the corresponding actions of the first watercraft comprises identifying actions performed by a human operator aboard the first watercraft.

4. The method according to claim 1, wherein obtaining the corresponding actions of the first watercraft comprises identifying remedial actions performed by a human operator aboard the first watercraft responsive to actions implemented by the ML algorithm.

5. The method according to claim 1, wherein the actions are selected from controlling: a buoyancy, a rudder, a control surface or plane, a thruster, a propeller, a propulsor, and/or a prime mover, of the watercraft.

6. The method according to claim 1, wherein the sets of environmental parameters include one or more sensor signals, the one or more sensor signals related to a pressure, a temperature, a salinity, a density, a tide, a current, a relative velocity, and/or a seabed of the water.

7. The method according to claim 1: wherein the training data include respective policies and corresponding trajectories of the set of watercraft, wherein each policy relates to navigating a watercraft of the set thereof in the water towards a target and wherein each corresponding trajectory comprises a series of states in a state space of the watercraft; and wherein training the ML algorithm comprising determining relationships between the respective policies and corresponding trajectories of the watercraft of the set thereof based on respective results of comparing the trajectories and the targets.

8. The method according to claim 7, wherein the ML algorithm comprises and/or is a reinforcement learning, RL, agent, and wherein training the ML algorithm comprises training the agent, the training comprising: (a) actioning, by the agent, a watercraft of the set thereof according to a respective policy, wherein the policy is of an action space of the agent, comprising navigating the watercraft of the set thereof towards a target, thereby defining a corresponding trajectory comprising a series of states in a state space of the watercraft and thereby obtaining respective training data; (b) determining a relationship between the policy and the trajectory based on a result of comparing the trajectory and the target and updating the policy based on the result; and (c) repeating (a) and (b) for the set of watercraft, using the updated policy.

9. A method of controlling a communicatively isolated watercraft, wherein the watercraft is a submarine or a submersible submerged in water, the method implemented, at least in part, by a computer, comprising a processor and a memory, aboard the watercraft, the method comprising: controlling, by a trained machine learning, ML, algorithm, the watercraft, the controlling comprising navigating the watercraft towards a target.

10. The method according to claim 9, wherein navigating the watercraft towards the target is according to a policy.

11. The method according to claim 9, comprising obtaining a set of environmental parameters.

12. The method according to claim 9, wherein the watercraft is an autonomous and/or unmanned watercraft.

13. The method according to claim 9, wherein navigating the watercraft towards the target comprises navigating the watercraft via one or more boundaries related to environmental parameters.

14. (canceled)

15. (canceled)

16. The method according to claim 5, wherein the control surface or plane includes a bow plane, a sail plane, or a stern plane.

17. A non-transient processor-readable medium encoded with instructions that when executed by one or more processors cause a process to be carried out for controlling a communicatively isolated watercraft, wherein the watercraft is a submarine or a submersible submerged in water, the process comprising: controlling, by a trained machine learning, ML, algorithm, the watercraft, the controlling comprising navigating the watercraft towards a target.

18. The non-transient processor-readable medium according to claim 17, wherein navigating the watercraft towards the target is according to a policy.

19. The non-transient processor-readable medium according to claim 17, comprising obtaining a set of environmental parameters.

20. The non-transient processor-readable medium according to claim 17, wherein the watercraft is an autonomous and/or unmanned watercraft.

21. The non-transient processor-readable medium according to claim 17, wherein navigating the watercraft towards the target comprises navigating the watercraft via one or more boundaries related to environmental parameters.

22. A communicatively isolated watercraft comprising the non-transient processor-readable medium according to claim 17.

Description

BRIEF DESCRIPTION OF THE FIGURES

[0050] Embodiments of the invention will now be described by way of example only with reference to the figures, in which:

[0051] FIG. 1 shows a method according to an exemplary embodiment;

[0052] FIG. 2 shows a method according to an exemplary embodiment;

[0053] FIG. 3 shows a method according to an exemplary embodiment;

[0054] FIG. 4 shows a method according to an exemplary embodiment; and

[0055] FIG. 5 shows a method according to an exemplary embodiment.

DETAILED DESCRIPTION

[0056] FIG. 1 shows a method 100 according to an exemplary embodiment. The method 100 is of training a machine learning, ML, algorithm to control a watercraft. The watercraft is a submarine or a submersible submerged in water. The method is implemented, at least in part, by a computer, comprising a processor and a memory, aboard the watercraft.

[0057] At 102, the method comprises obtaining training data including respective sets of environmental parameters and corresponding actions of a set of communicatively isolated watercraft, including a first watercraft.

[0058] At 104, the method comprises training the ML algorithm comprising determining relationships between the respective sets of environmental parameters and the corresponding actions of the watercraft of the set thereof.

[0059] The method 100 may include any of the steps described with respect to the first aspect.

[0060] FIG. 2 shows a method 200 according to an exemplary embodiment. The method 200 is of controlling a communicatively isolated watercraft. The watercraft is a submarine or a submersible submerged in water. The method is implemented, at least in part, by a computer, comprising a processor and a memory, aboard the watercraft.

[0061] At 202, the method comprises controlling, by a trained machine learning, ML, algorithm, the watercraft, comprising navigating the watercraft towards a target.

[0062] The method 200 may include any of the steps described with respect to the second aspect.

[0063] FIG. 3 shows a method according to an exemplary embodiment. Particularly, FIG. 3 shows a side elevation view of a submarine 300 diving with a velocity V in water W (salt water), having a surface S and a thermocline T. The thermocline T (i.e. a boundary) is detected, based on temperature sensor signals (i.e. environmental parameters). A density of the water W below the thermocline T is relatively higher, since a temperature of the water W therebelow is relatively lower, compared with the water W in the epipelagic zone. To maintain the velocity V of the submarine 300 upon diving through the thermocline T (i.e. including maintaining a constant rate of descent), corresponding actions are actioned: a level in the ballast tank 302 of the submarine 300 is adjusted and optionally, an inclination of the bow plane 304 adjusted. The environmental parameters and the corresponding actions may be used for training the ML algorithm. Alternatively, the trained ML algorithm may implement these actions responsive to detecting the thermocline T.

[0064] FIG. 4 shows a method according to an exemplary embodiment. Particularly, FIG. 4 shows a side elevation view of a submarine 400 moving with a velocity V in salt water SW towards a fresh water FW pocket for example near an estuary, having a surface S. A boundary B between the salt water SW and the fresh water FW is detected, based on salinity sensor signals (i.e. environmental parameters). A density of the fresh water FW is relatively lower, compared with the salt water FW. To maintain the velocity V of the submarine 400 upon moving through the boundary B (i.e. including maintaining a constant depth), corresponding actions are actioned: a level in the ballast tank 402 of the submarine 400 is adjusted and optionally, an inclination of the bow plane 404 adjusted. The environmental parameters and the corresponding actions may be used for training the ML algorithm. Alternatively, the trained ML algorithm may implement these actions responsive to detecting the boundary B.

[0065] FIG. 5 shows a method according to an exemplary embodiment. Particularly, FIG. 5 shows a plan view of a submarine 500 moving with a velocity V in water W (salt water). A boundary B between currents C1 and C2 is detected, based on relative velocity sensor signals (i.e. environmental parameters). A velocity of the water W due to the current C1 is different to the velocity of the water W due to the current C2. To maintain the velocity V of the submarine 500 upon moving through the boundary B (i.e. including maintaining a constant bearing), corresponding actions are actioned: an inclination of the rudder plane 506 adjusted. The environmental parameters and the corresponding actions may be used for training the ML algorithm. Alternatively, the trained ML algorithm may implement these actions responsive to detecting the boundary B.

METHOD AND APPARATUS FOR CONTROLLING A COMMUNICATIVELY ISOLATED WATERCRAFT

Assignee

Inventors

Cpc classification

Classification Explorer

G05D2107/25

PHYSICS

Classification Explorer

B63G8/001

PERFORMING OPERATIONS; TRANSPORTING

Classification Explorer

G05D2101/15

PHYSICS

Classification Explorer

G05D2109/38

PHYSICS

Classification Explorer

G06N3/08

PHYSICS

Classification Explorer

G05D1/606

PHYSICS

Classification Explorer

G06N3/04

PHYSICS

Classification Explorer

B63G2008/004

PERFORMING OPERATIONS; TRANSPORTING

Classification Explorer

G05D1/0206

PHYSICS

International classification

Classification Explorer

G05D1/606

PHYSICS

Classification Explorer

B63G8/00

PERFORMING OPERATIONS; TRANSPORTING

Abstract

Claims

Description