METHOD AND APPARATUS FOR CONTROLLING A COMMUNICATIVELY ISOLATED WATERCRAFT
20240272644 ยท 2024-08-15
Assignee
Inventors
- Lawrence Edward Clabburn (Barrow-in-Furness Cumbria, GB)
- Simon Phillip Newby (Barrow-in-Furness Cumbria, GB)
- David Charles Alexander Ritchie (Barrow-in-Furness Cumbria, GB)
Cpc classification
B63G8/001
PERFORMING OPERATIONS; TRANSPORTING
G05D1/606
PHYSICS
International classification
Abstract
A method of training a machine learning, ML, algorithm to control a watercraft is described. The watercraft is a submarine or a submersible submerged in water. The method is implemented, at least in part, by a computer, comprising a processor and a memory, aboard the watercraft. The method comprises: obtaining training data including respective sets of environmental parameters and corresponding actions of a set of communicatively isolated watercraft, including a first watercraft; and training the ML algorithm comprising determining relationships between the respective sets of environmental parameters and the corresponding actions of the watercraft of the set thereof. A method of controlling a watercraft by a trained ML algorithm is also described.
Claims
1. A method of training a machine learning, ML, algorithm to control a watercraft, wherein the watercraft is a submarine or a submersible submerged in water, the method implemented, at least in part, by a computer, comprising a processor and a memory, aboard the watercraft, the method comprising: obtaining training data including respective sets of environmental parameters and corresponding actions of a set of communicatively isolated watercraft, including a first watercraft; and training the ML algorithm comprising determining relationships between the respective sets of environmental parameters and the corresponding actions of the watercraft of the set thereof.
2. The method according to claim 1, wherein determining the relationships between the respective sets of environmental parameters and the corresponding actions of the watercraft of the set thereof comprises detecting boundaries related to the environmental parameters and relating the corresponding actions to the detected boundaries.
3. The method according to claim 1, wherein obtaining the corresponding actions of the first watercraft comprises identifying actions performed by a human operator aboard the first watercraft.
4. The method according to claim 1, wherein obtaining the corresponding actions of the first watercraft comprises identifying remedial actions performed by a human operator aboard the first watercraft responsive to actions implemented by the ML algorithm.
5. The method according to claim 1, wherein the actions are selected from controlling: a buoyancy, a rudder, a control surface or plane, a thruster, a propeller, a propulsor, and/or a prime mover, of the watercraft.
6. The method according to claim 1, wherein the sets of environmental parameters include one or more sensor signals, the one or more sensor signals related to a pressure, a temperature, a salinity, a density, a tide, a current, a relative velocity, and/or a seabed of the water.
7. The method according to claim 1: wherein the training data include respective policies and corresponding trajectories of the set of watercraft, wherein each policy relates to navigating a watercraft of the set thereof in the water towards a target and wherein each corresponding trajectory comprises a series of states in a state space of the watercraft; and wherein training the ML algorithm comprising determining relationships between the respective policies and corresponding trajectories of the watercraft of the set thereof based on respective results of comparing the trajectories and the targets.
8. The method according to claim 7, wherein the ML algorithm comprises and/or is a reinforcement learning, RL, agent, and wherein training the ML algorithm comprises training the agent, the training comprising: (a) actioning, by the agent, a watercraft of the set thereof according to a respective policy, wherein the policy is of an action space of the agent, comprising navigating the watercraft of the set thereof towards a target, thereby defining a corresponding trajectory comprising a series of states in a state space of the watercraft and thereby obtaining respective training data; (b) determining a relationship between the policy and the trajectory based on a result of comparing the trajectory and the target and updating the policy based on the result; and (c) repeating (a) and (b) for the set of watercraft, using the updated policy.
9. A method of controlling a communicatively isolated watercraft, wherein the watercraft is a submarine or a submersible submerged in water, the method implemented, at least in part, by a computer, comprising a processor and a memory, aboard the watercraft, the method comprising: controlling, by a trained machine learning, ML, algorithm, the watercraft, the controlling comprising navigating the watercraft towards a target.
10. The method according to claim 9, wherein navigating the watercraft towards the target is according to a policy.
11. The method according to claim 9, comprising obtaining a set of environmental parameters.
12. The method according to claim 9, wherein the watercraft is an autonomous and/or unmanned watercraft.
13. The method according to claim 9, wherein navigating the watercraft towards the target comprises navigating the watercraft via one or more boundaries related to environmental parameters.
14. (canceled)
15. (canceled)
16. The method according to claim 5, wherein the control surface or plane includes a bow plane, a sail plane, or a stern plane.
17. A non-transient processor-readable medium encoded with instructions that when executed by one or more processors cause a process to be carried out for controlling a communicatively isolated watercraft, wherein the watercraft is a submarine or a submersible submerged in water, the process comprising: controlling, by a trained machine learning, ML, algorithm, the watercraft, the controlling comprising navigating the watercraft towards a target.
18. The non-transient processor-readable medium according to claim 17, wherein navigating the watercraft towards the target is according to a policy.
19. The non-transient processor-readable medium according to claim 17, comprising obtaining a set of environmental parameters.
20. The non-transient processor-readable medium according to claim 17, wherein the watercraft is an autonomous and/or unmanned watercraft.
21. The non-transient processor-readable medium according to claim 17, wherein navigating the watercraft towards the target comprises navigating the watercraft via one or more boundaries related to environmental parameters.
22. A communicatively isolated watercraft comprising the non-transient processor-readable medium according to claim 17.
Description
BRIEF DESCRIPTION OF THE FIGURES
[0050] Embodiments of the invention will now be described by way of example only with reference to the figures, in which:
[0051]
[0052]
[0053]
[0054]
[0055]
DETAILED DESCRIPTION
[0056]
[0057] At 102, the method comprises obtaining training data including respective sets of environmental parameters and corresponding actions of a set of communicatively isolated watercraft, including a first watercraft.
[0058] At 104, the method comprises training the ML algorithm comprising determining relationships between the respective sets of environmental parameters and the corresponding actions of the watercraft of the set thereof.
[0059] The method 100 may include any of the steps described with respect to the first aspect.
[0060]
[0061] At 202, the method comprises controlling, by a trained machine learning, ML, algorithm, the watercraft, comprising navigating the watercraft towards a target.
[0062] The method 200 may include any of the steps described with respect to the second aspect.
[0063]
[0064]
[0065]