OPERATION MODE SELECTION DEVICE, OPERATION MODE SELECTION ASSISTANCE DEVICE, SHIP, OPERATION MODE SELECTION METHOD, AND PROGRAM
20250376252 ยท 2025-12-11
Assignee
Inventors
Cpc classification
B63J2003/043
PERFORMING OPERATIONS; TRANSPORTING
B63J3/04
PERFORMING OPERATIONS; TRANSPORTING
International classification
B63J3/04
PERFORMING OPERATIONS; TRANSPORTING
B63B79/20
PERFORMING OPERATIONS; TRANSPORTING
Abstract
This operation mode selection device selects an operation mode of a ship power system including one or a plurality of generators and batteries, the operation mode is determined through at least the number of generators in operation, the selection of a generator to be operated, a load sharing ratio of the generator and a battery, the selection of charging and discharging of the battery, and the device comprises a reinforcement learning unit which selects the operation mode; and performing reinforcement learning on a software agent.
Claims
1. An operation mode selection device that selects an operation mode of a ship power system including one or a plurality of generators and one or a plurality of batteries, in which the operation mode is determined by at least the number of the generators in operation, selection of the generators to be operated, a load sharing ratio between the generators and the batteries, and selection of charging and discharging of the batteries, the operation mode selection device comprising: a reinforcement learning unit that selects the operation mode by performing reinforcement learning on a software agent, using an output of a model of the ship power system that operates based on a prescribed setting pertaining to a load of the ship power system, as an environmental element, at least the number of the generators in operation, selection of the generators to be operated, a load sharing ratio between the generators and the batteries, and selection of charging and discharging of the batteries, as an action element, and a fuel cost and a lifecycle cost pertaining to a component lifetime, as a reward element.
2. The operation mode selection device according to claim 1, wherein the reinforcement learning unit performs the reinforcement learning on the software agent, by imposing a penalty in at least one of cases where a DC bus voltage of the ship power system becomes unstable to a degree greater than a predetermined degree, or where frequency of switching the operation mode is equal to or greater than a predetermined threshold value.
3. The operation mode selection device according to claim 1, wherein the setting includes information related to the load of the ship power system and information related to an ambient temperature.
4. The operation mode selection device according to claim 3, wherein the operation mode is further determined by selection of whether or not to supply power from shore, and the action element further includes selection of whether or not to supply power from shore.
5. An operation mode selection assistance device comprising: an operation mode selection assistance unit that assists in a selection operation of the operation mode based on the operation mode selected by the reinforcement learning unit according to claim 1.
6. A ship comprising: the operation mode selection assistance device according to claim 5; and the ship power system.
7. An operation mode selection method of selecting an operation mode of a ship power system including one or a plurality of generators and one or a plurality of batteries, in which the operation mode is determined by at least the number of the generators in operation, selection of the generators to be operated, a load sharing ratio between the generators and the batteries, and selection of charging and discharging of the batteries, the operation mode selection method comprising: a step of selecting the operation mode by performing reinforcement learning on a software agent, using an output of a model of the ship power system that operates based on a prescribed setting pertaining to a load of the ship power system, as an environmental element, at least the number of the generators in operation, selection of the generators to be operated, a load sharing ratio between the generators and the batteries, and selection of charging and discharging of the batteries, as an action element, and a fuel cost and a lifecycle cost pertaining to a component lifetime, as a reward element.
8. A non-transitory computer-readable recording medium storing a program for selecting an operation mode of a ship power system including one or a plurality of generators and one or a plurality of batteries, in which the operation mode is determined by at least the number of the generators in operation, selection of the generators to be operated, a load sharing ratio between the generators and the batteries, and selection of charging and discharging of the batteries, the program for causing a computer to execute: a step of selecting the operation mode by performing reinforcement learning on a software agent, using an output of a model of the ship power system that operates based on a prescribed setting pertaining to a load of the ship power system, as an environmental element, at least the number of the generators in operation, selection of the generators to be operated, a load sharing ratio between the generators and the batteries, and selection of charging and discharging of the batteries, as an action element, and a fuel cost and a lifecycle cost pertaining to a component lifetime, as a reward element.
Description
BRIEF DESCRIPTION OF DRAWINGS
[0012]
[0013]
[0014]
[0015]
[0016]
[0017]
[0018]
[0019]
[0020]
[0021]
[0022]
[0023]
[0024]
[0025]
[0026]
[0027]
[0028]
DESCRIPTION OF EMBODIMENTS
[0029] Hereinafter, an operation mode selection device, an operation mode selection assistance device, a ship, an operation mode selection method, and a program according to an embodiment of the present disclosure will be described with reference to
(Configuration of Operation Mode Selection Device (1))
[0030]
(Configuration and Operation Mode of Ship Power System)
[0031] First, the ship power system 100 shown in
[0032] The AC-DC converters 110,111, and 112 convert AC power generated by the generators 101,102, and 103 into DC power and supply the DC power to the DC buses 107 or 108. The DC-DC converters 113 and 114 are connected to the DC bus 107 or 108, and control the charging and discharging power of batteries 104 or 105. The DC-AC converter 115 converts the DC power input from the DC bus 107 into AC power and drives a propulsion motor 120 of the ship. The DC-AC converter 116 converts the DC power input from the DC bus 107 into AC power and outputs the AC power to an AC load 125 via a transformer 123 or the like. The DC-AC converter 117 converts the DC power input from the DC bus 107 into AC power and drives a bow thruster motor 121 of the ship. The DC-AC converter 118 converts the DC power input from the DC bus 108 into AC power and drives a propulsion motor 122 of the ship. The DC-AC converter 119 converts the DC power input from the DC bus 108 into AC power and outputs the AC power to an AC load 126 via a transformer 124 or the like, or converts the AC power input from a shore power 127 via a switch 128, the transformer 124, or the like into the DC power and outputs the DC power to the DC bus 108.
[0033] In the present embodiment, the operation mode of the ship power system 100 is the operational state of the ship power system 100. The operation modes of the present embodiment include, for example, a mode in which all the generators 101 to 103 are operated, a mode in which some of the generators 101 to 103 are operated, and a mode in which none of the generators 101 to 103 are operated. In addition, the operation modes of the present embodiment include, for example, a mode in which both of the batteries 104 and 105 are discharged, a mode in which one of the batteries 104 and 105 is discharged, a mode in which both of the batteries 104 and 105 are charged, a mode in which one of the batteries 104 and 105 is charged, and a mode in which neither of the batteries 104 and 105 is charged or discharged. In addition, there are a plurality of operation modes by a combination of each mode of the generators 101 to 103 and each mode of the batteries 104 and 105. Further, the operation modes are made different by making the values of the generated power and the charging and discharging power equal or different for each of the generators 101 to 103 and the batteries 104 to 105.
[0034] Here, with reference to
[0035] The shore power mode shown in
[0036] The fully electric propulsion mode shown in
[0037] The hybrid mode shown in
[0038] In the present embodiment, the operation mode of the ship power system 100 is defined by at least the number of generators 101 to 103 in operation, the selection of the generators 101 to 103 to be operated, a load sharing ratio between the generators 101 to 103 and the batteries 104 and 105, and the selection of charging and discharging of the batteries 104 and 105. The operation mode of the ship power system 100 can be switched by controlling the operations of the AC-DC converters 110 to 112, the DC-DC converters 113 to 114, the DC-AC converters 115 to 119, and the switch 128, for example, through operation on a control panel (control panel 150 in
(Configuration of Operation Mode Selection Device (2))
[0039] The input and output unit 11 shown in
[0040] The reinforcement learning unit 12 selects an operation mode by performing reinforcement learning on a software agent, using an output of a model of the ship power system 100 that operates based on a prescribed setting pertaining to a load of the ship power system 100, as an environmental element, at least the number of generators 101 to 103 in operation, the selection of the generators 101 to 103 to be operated, a load sharing ratio between the generators 101 to 103 and the batteries 104 and 105, and the selection of charging and discharging of the batteries 104 and 105, as an action element, and a fuel cost and a lifecycle cost pertaining to a component lifetime, as a reward element. In the present embodiment, the prescribed setting pertaining to a load of the ship power system 100 is the load profile 132. The load profile 132 includes, for example, a time series of the load of the ship power system 100 and a time series of the ambient temperature.
[0041] The reinforcement learning unit 12 performs the reinforcement learning on the software agent, by imposing a penalty in at least one of case where the voltages of the DC buses 107 and 108 of the ship power system 100 become unstable to a degree greater than a predetermined degree, or where the frequency of switching the operation mode is equal to or greater than a predetermined threshold value.
[0042]
[0043] The software agent 12-1 includes a reinforcement learning processing unit 12-11 and a machine learning model 12-12. The machine learning model 12-12 is, for example, a machine learning model using a neural network, and inputs an environmental element observed by the software agent 12-1 and outputs the action element representing the action. The machine learning model 12-12 is machine-learned by the reinforcement learning processing unit 12-11 based on a predetermined reinforcement learning algorithm. The reinforcement learning algorithm is not limited, and any existing algorithm can be used. The reinforcement learning processing unit 12-11 inputs the environmental element that satisfies the constraint condition 133 and is observed by the software agent 12-1 to the machine learning model 12-12 based on the initial condition 134, and performs machine learning on the machine learning model 12-12 so that the action element that maximizes the reward is output. In the present embodiment, the action element is the element representing the action in reinforcement learning. The environmental element is the element observed in reinforcement learning. The reward element is the element representing the reward in reinforcement learning.
[0044] In the present embodiment, the action is the selection of an operation mode. The operation mode is represented by the number of generators in operation, the selection of generators to be operated, the load sharing ratio, and the selection of battery charging and discharging. In this case, the action elements are the number of generators in operation, the selection of generators to be operated, the load sharing ratio, and the selection of battery charging and discharging.
[0045] The operation mode may be further determined by selection of whether or not to supply power from the shore. In addition, the action elements may further include the selection of whether or not to supply power from the shore.
[0046] The reward is a lifecycle cost of the ship power system 100. The lifecycle cost is, for example, a total amount of a fuel cost required for a certain period such as a product lifetime, a design lifetime, or a planned usage period of the ship power system 100 and a cost other than the fuel cost such as a component replacement cost or an adjustment cost. The reward calculation unit 12-2 calculates the lifecycle cost based on the data indicating the operating status of the ship power system 100 output from the power system model 131 and the data indicating the characteristics of the lifetime of each device or component. In addition, for example, the reward calculation unit 12-2, by imposing a penalty in a case where the action that causes the DC bus voltage to become unstable is selected, such as when the supply power is lower than the load, or a case where the operation mode is frequently switched, notifies the software agent 12-1 that the operation mode determined to be a penalty is invalid, or adjusts the reward.
[0047]
[0048] In addition, the constraint condition 133 is, for example, a constraint that a generator is not operated and a motor is driven by a battery in order to reduce noise or the like in a port during sailing, as shown as a constraint condition in
[0049]
[0050] In the present embodiment, the power system model 131 is a simulation model that outputs environmental elements targeted for observation by the software agent 12-1. The power system model 131 is a model of the ship power system 100 that operates based on a prescribed setting (load profile 132) pertaining to the load of the ship power system 100 described above. The power system model 131 outputs the following elements (environmental elements) representing the operating status of the power system. That is, the power system model 131 outputs, for example, a battery SOC state, load sharing status, a device operating time, a DC grid voltage, an ambient temperature, a device temperature, device efficiency, DC grid power supply and demand status, operational status (in port, on standby, or the like), and the like as the environmental elements.
(Operation Example of Operation Mode Selection Device)
[0051]
[0052] Next, the reinforcement learning unit 12 executes reinforcement learning (S14) while advancing or resetting the time stamp in the load profile 132, until the learning completion condition is satisfied (S15: YES). When the learning completion condition is satisfied (S15: YES), the reinforcement learning unit 12 stores the operation mode selection result in the storage unit 13 as the operation mode selection result 136 in association with the load profile 132, stores the content of the reinforcement learning such as the reward in the storage unit 13 as the trained result 135 (S16), and ends the processing shown in
(Configuration of Operation Mode Selection Assistance Device)
[0053]
[0054] The load profile 231 and the operation mode selection result 232 are the same as the load profile 132 and the operation mode selection result 136 shown in
(Operation Example of Operation Mode Selection Assistance Device)
[0055]
Operations and Effects
[0056] In a DC microgrid for a ship, there are various operation modes, and it is possible to select parameters such as charging and discharging of a battery, and a load sharing ratio between a generator and a battery. However, there are many parameters to be considered, such as fuel efficiency of a generator engine, SOC and lifetime of the battery, a load condition, and sailing status (before docking and after departure), making optimal mode selection difficult. On the other hand, according to the present embodiment, the reinforcement learning unit 12 is provided that selects an operation mode by performing reinforcement learning on a software agent, using an output of the power system model 131 that operates based on the load profile 132 of the ship power system 100, as an environmental element, at least the number of generators in operation, the selection of the generators to be operated, a load sharing ratio between the generators and the batteries, as an action element, and a fuel cost and a lifecycle cost pertaining to a component lifetime, as a reward element. Therefore, it is possible to appropriately select an operation mode in the power system of the ship.
OTHER EMBODIMENTS
[0057] Hereinabove, the embodiment of the present disclosure has been described in detail with reference to the drawings, but the specific configuration is not limited to the embodiment, and includes design changes and the like within a scope not departing from the gist of the present disclosure.
<Configuration of Computer>
[0058]
[0059] A computer 90 includes a processor 91, a main memory 92, a storage 93, and an interface 94.
[0060] The operation mode selection device 1 and the operation mode selection assistance device 2 described above are mounted on the computer 90. The operation of each processing unit described above is stored in the storage 93 in the form of a program. The processor 91 reads the program from the storage 93, develops the program in the main memory 92, and executes the above-described processing according to the program. In addition, the processor 91 secures a storage area corresponding to each storage unit described above in the main memory 92 according to the program.
[0061] The program may be for realizing some of the functions to be exhibited by the computer 90. For example, the program may exhibit a function in combination with another program already stored in a storage or in combination with another program implemented in another device. In another embodiment, the computer may include a custom large scale integrated (LSI) circuit such as a programmable logic device (PLD) in addition to or instead of the above configuration. Examples of the PLD include a programmable array logic (PAL), a generic array logic (GAL), a complex programmable logic device (CPLD), and a field programmable gate array (FPGA). In this case, some or all of the functions realized by the processor may be realized by the integrated circuit.
[0062] Examples of the storage 93 include a hard disk drive (HDD), a solid state drive (SSD), a magnetic disk, a magneto-optical disk, a compact disc read only memory (CD-ROM), a digital versatile disc read only memory (DVD-ROM), and a semiconductor memory. The storage 93 may be an internal medium directly connected to a bus of the computer 90, or may be an external medium connected to the computer 90 via the interface 94 or a communication line. In addition, when this program is distributed to the computer 90 via the communication line, the computer 90 that has received the distribution may develop the program in the main memory 92, and may execute the above-described processing. In at least one embodiment, the storage 93 is a non-transitory tangible storage medium.
ADDITIONAL NOTES
[0063] The operation mode selection device 1, the operation mode selection assistance device 2, the ship 300, the operation mode selection method, and the program according to each embodiment are understood as follows, for example. [0064] (1) An operation mode selection device 1 according to a first aspect is an operation mode selection device that selects an operation mode of a ship power system 100 including one or a plurality of generators 101 to 103 and one or a plurality of batteries 104 and 105, in which the operation mode is determined by at least the number of the generators in operation, selection of the generators to be operated, a load sharing ratio between the generators and the batteries, and selection of charging and discharging of the batteries, the operation mode selection device including: a reinforcement learning unit 12 that selects the operation mode by performing reinforcement learning on a software agent 12-1 using an output of a model of the ship power system (power system model 131) that operates based on a prescribed setting (load profile 132) pertaining to a load of the ship power system, as an environmental element, at least the number of the generators in operation, selection of the generators to be operated, a load sharing ratio between the generators and the batteries, and selection of charging and discharging of the batteries, as an action element, and a fuel cost and a lifecycle cost pertaining to a component lifetime; as a reward element. According to the present aspect and each of the following aspects, the operation mode of the power system of the ship can be appropriately selected. [0065] (2) An operation mode selection device 1 according to a second aspect is the operation mode selection device according to (1), in which the reinforcement learning unit 12 performs the reinforcement learning on the software agent, by imposing a penalty in at least one of cases where a DC bus voltage of the ship power system becomes unstable to a degree greater than a predetermined degree, or where frequency of switching the operation mode is equal to or greater than a predetermined threshold value. [0066] (3) An operation mode selection device 1 according to a third aspect is the operation mode selection device according to (1) or (2), in which the setting (load profile 132) includes information related to a load of the ship power system and information related to an ambient temperature. [0067] (4) An operation mode selection device 1 according to a fourth aspect is the operation mode selection device according to any one of (1) to (3), in which the operation mode is further determined by selection of whether or not to supply power from shore, and the action element further includes selection of whether or not to supply power from shore. [0068] (5) An operation mode selection assistance device 2 according to a fifth aspect includes an operation mode selection assistance unit 22 that assists in a selection operation of the operation mode based on the operation mode selected by the reinforcement learning unit 12 according to (1). [0069] (6) A ship 300 according to a sixth aspect includes the operation mode selection assistance device 2 according to (5), and the ship power system 100. [0070] (7) An operation mode selection method according to a seventh aspect is an operation mode selection method of selecting an operation mode of a ship power system including one or a plurality of generators and one or a plurality of batteries, in which the operation mode is determined by at least the number of the generators in operation, selection of the generators to be operated, a load sharing ratio between the generators and the batteries, and selection of charging and discharging of the batteries, the operation mode selection method including: a step of selecting the operation mode by performing reinforcement learning on a software agent, using an output of a model of the ship power system that operates based on a prescribed setting pertaining to a load of the ship power system, as an environmental element, at least the number of the generators in operation, selection of the generators to be operated, a load sharing ratio between the generators and the batteries, and selection of charging and discharging of the batteries, as an action element, and a fuel cost and a lifecycle cost pertaining to a component lifetime, as a reward element. [0071] (8) A program according to an eighth aspect is a program for selecting an operation mode of a ship power system including one or a plurality of generators and one or a plurality of batteries, in which the operation mode is determined by at least the number of the generators in operation, selection of the generators to be operated, a load sharing ratio between the generators and the batteries, and selection of charging and discharging of the batteries, the program for causing a computer to execute: a step of selecting the operation mode by performing reinforcement learning on a software agent, using an output of a model of the ship power system that operates based on a prescribed setting pertaining to a load of the ship power system, as an environmental element, at least the number of the generators in operation, selection of the generators to be operated, a load sharing ratio between the generators and the batteries, and selection of charging and discharging of the batteries, as an action element, and a fuel cost and a lifecycle cost pertaining to a component lifetime, as a reward element.
INDUSTRIAL APPLICABILITY
[0072] According to the operation mode selection device, the operation mode selection assistance device, the ship, the operation mode selection method, and the program of the present disclosure, it is possible to appropriately select the operation mode in the power system of the ship.
REFERENCE SIGNS LIST
[0073] 1: operation mode selection device [0074] 2: operation mode selection assistance device [0075] 12: reinforcement learning unit [0076] 12-1: software agent [0077] 22: operation mode selection assistance unit [0078] 131: power system model [0079] 132, 231: load profile [0080] 133, 232: operation mode selection result [0081] 100: ship power system [0082] 101 to 103: generator [0083] 104 and 105: battery [0084] 300: ship