SIMULATION DEVICE, SIMULATION METHOD, AND MEMORY MEDIUM
20170255720 · 2017-09-07
Assignee
Inventors
Cpc classification
G06F17/16
PHYSICS
G06F17/18
PHYSICS
International classification
G06F17/18
PHYSICS
G06F17/16
PHYSICS
Abstract
A simulation device includes a system model, a data selection processing unit, a plurality of observation models, a post-distribution creating unit, a post-distribution unifying unit, and a determining unit. The system model calculates a time evolution of a state vector. The data selection processing unit selects multiple items of observation data. The observation model converts the state vector from the system model on the basis of the relationship with the observation data. The post-distribution creating unit creates, on the basis of the state vector from the observation model and the selected observation data, a first post-distribution based on all pieces of the observation data or a second post-distribution based on absent observation data. The post-distribution unifying unit unifies the first and second post-distributions. The determining unit determines which of the second post-distribution or the unified post-distribution is to be used.
Claims
1. A simulation device, comprising: a memory that stores a set of instructions; and at least one processor configured to execute the set of instructions to: obtain an initial state of a state vector and a parameter in a simulation and a plurality of pieces of observation data as input; operate as a system model that, based on the initial state and the parameter, simulates a time evolution of the state vector; select, based on information relating to the state vector in the system model, from the plurality of pieces of observation data, a plurality of pieces of observation data to be used; operate as plurality of observation models, each being associated with one of the selected plurality of pieces of observation data, each of which transforms and outputs a state vector output from the system model based on a relationship between the observation data and the state vector; create, based on state vectors output from the plurality of observation models and pieces of observation data selected posterior distributions of the state vector, outputting a posterior distribution based on all pieces of observation data selected as a first posterior distribution, and output a posterior distribution based on a set of observation data lacking one or more pieces of observation data as a second posterior distribution; perform unification of the first posterior distribution and the second posterior distribution; determine which one of the second posterior distribution and a posterior distribution after the unification is to be used; and output, in addition to inputting a state vector including a posterior distribution determined and the first posterior distribution to the system model, a time series of the state vector.
2. The simulation device according to claim 1, wherein the at least one processor is configured to: create, by comparing pieces of information that relate to a state vector set in the system model with the pieces of observation data, the observation models related to the pieces of observation data.
3. The simulation device according to claim 1, wherein the at least one processor is configured to: set noise amounts of the observation models related to the pieces of observation data.
4. The simulation device according to claim 1, wherein the at least one processor is configured to: use, in processing of unifying the first posterior distribution and the second posterior distribution, a model created based on a correlation between posterior distributions that are already calculated.
5. The simulation device according to claim 4, wherein the at least one processor is configured to: apply Bayesian updating to a parameter characterizing an arithmetic operation of the model to be used in the unification based on a correlation between posterior distributions that are already calculated.
6. The simulation device according to claim 1, wherein the at least one processor is configured to: determine, based on variance values of the second posterior distribution and a posterior distribution after the unification, which one of the second posterior distribution and the posterior distribution after the unification is to be used.
7. The simulation device according to claim 1, wherein the state vector includes state variables each of which related to one of grid points discretized in a domain over which simulation is performed, and the observation models relate the grid points related to the state variables with a degree of resolution of an observation point of one of the plurality of pieces of observation data for each of the pieces of observation data.
8. The simulation device according to claim 1, wherein a probability distribution of each of the state variables is approximated by a set of ensembles that are discretized and calculated independently of one another, and the at least one processor is configured to: perform unification by superposing probability distributions of the state variables at a predetermined ratio, the probability distributions being approximated by the sets of ensembles.
9. A simulation method, the method comprising: when an initial state of a state vector and a parameter in a simulation and a plurality of pieces of observation data are input, simulating a time evolution of the state vector using a system model based on the initial state and the parameter; selecting, from the plurality of pieces of observation data, a plurality of pieces of observation data to be used based on information related to the state vector in the system model; transforming, by use of a plurality of observation models each of which is associated with one of the selected plurality of pieces of observation data, the state vector output from the system model based on a relationship between the piece of observation data and the state vector; creating posterior distributions of the state vector based on state vectors output from the plurality of observation models and the selected pieces of observation data; performing unification of a first posterior distribution based on all the selected pieces of observation data and a second posterior distribution based on a set of observation data lacking one or more pieces of observation data; determining which one of the second posterior distribution and a posterior distribution after the unification is to be used; inputting a state vector including a determined posterior distribution and the first posterior distribution to the system model; and outputting a time series of a state vector including a determined posterior distribution and the first posterior distribution.
10. A non-transitory computer-readable storage medium storing a computer program, the program making a computer device execute: input processing of obtaining an initial state of a state vector and a parameter in a simulation and a plurality of pieces of observation data as input; system model calculation processing of, based on the initial state and the parameter, simulating a time evolution of the state vector using a system model; data selection processing of, based on information relating to the state vector in the system model, selecting, from the plurality of pieces of observation data, a plurality of pieces of observation data to be used; observation model calculation processing of, by use of a plurality of observation models each of which is associated with one of the selected plurality of pieces of observation data, transforming and outputting each state vector output from the system model based on a relationship between the piece of observation data and the state vector; posterior distribution creating processing of, based on state vectors output from the plurality of observation models and pieces of observation data selected in the data selection processing, creating posterior distributions of the state vector, outputting a posterior distribution based on all pieces of observation data selected in the data selection processing as a first posterior distribution, and outputting a posterior distribution based on a set of observation data lacking one or more pieces of observation data as a second posterior distribution; posterior distribution unifying processing of performing unification of the first posterior distribution and the second posterior distribution; determining processing of determining which one of the second posterior distribution and a posterior distribution after the unification is to be used; and output processing of inputting a state vector including a posterior distribution determined in the determining processing and the first posterior distribution to the system model and outputting a time series of the state vector.
Description
BRIEF DESCRIPTION OF DRAWINGS
[0023]
[0024]
[0025]
[0026]
[0027]
[0028]
[0029]
[0030]
[0031]
[0032]
[0033]
[0034]
[0035]
[0036]
[0037]
DESCRIPTION OF EMBODIMENTS
[0038] Hereinafter, example embodiments of the present invention will be described with reference to drawings.
First Example Embodiment
[0039] A simulation device 100 as a first example embodiment of the present invention will be described. The simulation device 100 is applicable to simulation that solves a continuous time-space partial differential equation formulated according to physical laws and follows a time evolution. Such partial differential equations include, for example, an equation of motion that describes motion, Navier-Stokes equations that describe fluid, a thermodynamic equation that describes thermal change, and a shallow-water wave equation that describes tsunamis. The simulation device 100 is also applicable to simulation using a finite element method. In the present example embodiment, it is assumed that a system subject to simulation is a system in which a state vector the temporal change of which is followed is linked with actual observation data by means of any relational expression, that is, a system that allows comparison between simulation results and observation data.
[0040] First, a configuration of the simulation device 100 is illustrated in
[0041] An example of a hardware configuration of the simulation device 100 is illustrated in
[0042] Next, details of each of the functional blocks of the simulation device 100 will be described.
[0043] First, the input unit 10 will be described. The input unit 10 obtains an initial state of a state vector and parameters in an observation domain over which simulation is to be performed and M types of observation data (first to M-th sets of observation data). Each of the M types of observation data are observation values from a sensor and the like. Each of the M types of observation data may have a different number of dimensions from or the same number of dimensions as that/those of a part or all of the other observation data. The input unit 10 may, for example, obtain the above-described information stored in the storage device 1004. The input unit 10 may also obtain the above-described information by obtaining storage position information thereof in the storage device 1004 via the input device 1005.
[0044] Next, the system model 21 will be described. The system model 21 simulates time evolutions of the state vector on the basis of the initial state and parameters obtained by the input unit 10. While the time evolution of an actual phenomenon subject to simulation is expressed by a continuous time-space partial differential equation, in order to perform simulation, a domain over which the simulation is performed needs to be discretized in time and space. The simulation device 100 uses a state vector, which is generated from a combination of state variables, to follow time evolutions of an actual phenomenon in an observation domain. The number of state variables may be defined in accordance with the purpose of the simulation and be set to any number. In the present example embodiment, a description will be made mainly on an example in which the number of state variables is two. In this case, two state variables U and V are also denoted, using a vector ξ, by ξ=(U, V).
[0045] The discretization in time is achieved by advancing a step from a state variable ξt at a time t and calculating a state variable ξt+1 at a time t+1. In the following description, a time indicates a step in a simulation, and, for example, a time t−1 means the step one step before a time t. Hereinafter, a step in a simulation is also referred to as a time step.
[0046] The discretization in space is achieved by assuming that a two-dimensional space is divided into a grid shape and denoting a state variable that is defined at a k-th grid point counting from a reference point and at a time t by (δ.sub.k)t. In the denotation, ξ.sub.k is denoted as ξ.sub.k=(U.sub.k, V.sub.k). In other words, a state vector is generated from state variables at respective grid points discretized within a domain over which a simulation is performed. When it is assumed that the last grid point number among the grid point numbers representing a domain over which a simulation is performed is denoted by L, a combination of state variables at a time t is denoted by a state vector X.sub.t, which is expressed by the expression (1) below:
X.sub.t=[ξ.sub.1, . . . , ξ.sub.k, . . . , ξ.sub.L].sub.t.sup.T=[(U.sub.1, V.sub.1), . . . , (U.sub.k, V.sub.k), . . . , (U.sub.L, V.sub.L)].sub.t.sup.T (1).
The sign T in the expression denotes transposition. The number of dimensions of the state vector is calculated as the product of the number of state variables per grid point and the number L of grid points. In the case of the expression (1), the state vector is a 2L dimensional vector.
[0047] The system model 21 performs an updating operation of a state vector X.sub.t−1 at a time t−1 to a state vector X.sub.t at a time t in the discretized time and space. When it is assumed that a mapping representing the updating operation is denoted by f, the system model 21 is described by a relational expression expressed by the expression (2) below:
X.sub.t=f(X.sub.t−1, θ, v.sub.t) (2).
Here, θ denotes a parameter vector including various parameters required for calculation in the model. In addition, V.sub.t denotes a system noise at the time t. The system noise V.sub.t is introduced, in order to numerically express an effect of incompleteness in the model, as a stochastic driving term that has an effect on the state vector. The mapping f may be linear or non-linear depending on a target phenomenon. As is obvious from the expression (2), the state vector X.sub.t at the time t does not have to be defined explicitly with the state vector X.sub.t−1 at the time t−1. That is, the system model 21 of the present example embodiment may calculate the state vector X.sub.t at the time t using the state vector X.sub.t−1 at the time t−1 as input.
[0048] The updating operation, by the system model 21, from a state vector at a time t−1 to a state vector at a time t will be described in detail below. First, ensemble approximation for coping with a simulation having incompleteness will be described. Hereinafter, in reflection of incompleteness of the mapping f and incompleteness in the parameter θ to be input, the state vector X.sub.t and the system noise V.sub.t in the system model 21 are treated as, instead of definite values X.sub.t and V.sub.t, probability distributions p(X.sub.t) and p(V.sub.t), respectively. Approximating the probability distributions p(X.sub.t) and p(V.sub.t) by sets of N ensembles i:
{X.sub.t−1.sup.(i)}.sub.1=1.sup.N {V.sub.t−1.sup.(i)}.sub.1=1.sup.N (3),
respectively, is referred to as ensemble approximation. Therefore, the system model 21 in an actual simulation calculates a time evolution of each ensemble i:
X.sub.t.sup.(i)=f(X.sub.t−1.sup.(i), θ, v.sub.t.sup.(i)) (4)
for all ensembles. From this calculation, the probability distribution p(X.sub.t) of the state vector X.sub.t at the time t is approximated by N ensembles:
{X.sub.t.sup.(i)}.sub.1=1.sup.N (5).
The ensemble calculation expressed by the expression (4) is characterized by being independent with respect to each ensemble. Therefore, the system model 21 may not only repeat calculation N times but also perform calculation once using N parallel processors, and may change calculation methods flexibly depending on calculation resources. Hereinafter, the probability distribution of the state vector X.sub.t is denoted by p(X.sub.t) and also referred to as a prior distribution.
[0049] As described above, using the system model 21 independently for each ensemble enables the probability distribution p(X.sub.t) of the state vector at the time t to be calculated from the probability distribution p(X.sub.t−1) of the state vector at the time t−1. The system model 21 outputs the calculated probability distribution p(X.sub.t), as a prior distribution, to the m observation models 31, which will be described later. The system model 21 may, for example, store the calculated prior distribution p(X.sub.t) in the prior distribution storage unit 22 or the like, which is readable by the m observation models 31.
[0050] Next, the data selection processing unit 30 and the observation models 31 will be described. The data selection processing unit 30 selects m types of plural pieces of observation data to be used out of the first to M-th sets of observation data on the basis of information relating to the state vector. The data selection processing unit 30 outputs the selected m types of observation data to the posterior distribution creating unit 40, which will be described later.
[0051] In the present example embodiment, it is assumed that information relating to the state vector is input from the system model 21 to the data selection processing unit 30 as a control signal CTL0. The control signal CTL0 may include, for example, the number of dimensions of the state vector X.sub.t and other information relating to the state variables. On the basis of the information included in the control signal CTL0, the data selection processing unit 30 selects m types of observation data OBS.sub.1 to OBS.sub.m at a time t, which are to be used. The data selection processing unit 30 outputs the selected sets of observation data OBS.sub.1 to OBS.sub.m to the posterior distribution creating unit 40.
[0052] The data selection processing unit 30 may create m observation models 31 each of which corresponds to one of the selected m types of observation data by comparing information on the state vector set by the system model 21 with the respective types of observation data (for example, physical quantities and dimensions). The data selection processing unit 30 may, for example, create the m observation models 31 using m control signals CTL1 to CTLm for relating the state vector X.sub.t with the sets of observation data OBS.sub.1 to OBS.sub.m.
[0053] The creation of the observation models 31 described above is to create an observation model equation that relates the state vector X.sub.t with the sets of observation data OBS.sub.1 to OBS.sub.m. Such relations between the state vector and the sets of observation data are illustrated schematically in
In this case, the data selection processing unit 30 may output information on mappings h.sub.1 to h.sub.m, and noise amounts w.sub.1 to w.sub.m, which are to be taken into consideration in the respective sets of observation data, in the expression (6) to the observation models 31-1 to 31-m as the control signals CTL1 to CTLm, respectively. From this processing, the m observation models 31 are created.
[0054] If it is assumed that the sets of observation data are ideally obtained at all grid points 1 to L, each of the noise amounts w.sub.1 to w.sub.m, and the sets of observation data OBS.sub.1 to OBS.sub.m in the expression (6) is an L-dimensional column vector. On the basis of a variance value and noise amount of each set of observation data, the data selection processing unit 30 may set the noise amount in the observation models 31 related to the set of observation data.
[0055] In the expression (6), E.sub.1 to E.sub.m are matrices that associate the grid points 1 to L of the system model 21 with resolutions of observation points at which the sets of observation data are actually obtained. For example, when it is assumed that the number of variables in a state variable ξ.sub.k=(U.sub.k, V.sub.k) at each grid point k in the state vector X.sub.t is two, each of the matrices E.sub.1 to E.sub.m is an at most 2L×2L dimensional matrix.
[0056] In general, in a state variable ξ.sub.k=(U.sub.k, V.sub.k) at a grid point k, U.sub.k and V.sub.k are physical quantities that differ from each other. Thus, relations between U.sub.k and observation data and between V.sub.k and the observation data are not able to be defined by mappings h.sub.1 to h.sub.m of an identical observation model equation. The following description will thus be made using a configuration for U.sub.k (k=1 to L) in the state variables as an example. In this case, each of the above-described matrices E.sub.1 to E.sub.m is a 2L×L dimensional matrix. For example, it is assumed that the set of observation data OBS.sub.1 is obtained with respect to all the grid points 1 to L of the system model 21. In this case, E.sub.1 is a 2L×L dimensional matrix and takes the form of a matrix expressed by the expression (7) below:
In the matrix, only the element at the j-th (j is an integer not smaller than 1 and not larger than L) row and {1+2(j−1)}th column has a value of 1.
[0057] A case in which no data is observed at some grid points in the grid points 1 to L, as the set of observation data OBS.sub.2 illustrated in
H.sub.j(X.sub.t, w.sub.j)≡h.sub.jE.sub.jX.sub.t+w.sub.j (8).
Thus, all the m types of observation models 31 individually perform the arithmetic operation expressed by the expression (8) on the state vector X.sub.t, and output all the m types of transformed state vectors X.sub.t to the posterior distribution creating unit 40. A combination of the expression (2) or the expression (4) and the expression (8) is referred to as a state space model.
[0058] Although, in the above-described example of E.sub.1 to E.sub.m, a case in which the grid points (L-dimensional) of the system model 21 and the grid points (L-dimensional) of the observation model coincide with each other is assumed, a case of noncoincidence is also conceivable practically. In such a case, the values of the respective elements of the matrices E.sub.1 to E.sub.m may be changed in such a way that each observation point at which a piece of observation data is actually obtained has, for example, a weighted average of values at neighboring grid points. As described above, the above-described E.sub.1 to E.sub.m express operations of relating the grid points of the state variables with degrees of resolution of observation points for a plurality of pieces of observation data in a manner of one-to-one, weighted average, weighted sum, or the like with respect to each piece of observation data.
[0059] As described above, each of the m observation models 31 related to one of the m sets of observation data selected by the data selection processing unit 30. Each of the observation model 31 transforms a state vector output from the system model 21 into a predetermined state vector on the basis of the expression (8), which expresses a relationship between a set of observation data and a state vector. Each of the observation model 31 outputs the transformed state vector to the posterior distribution creating unit 40. The transformed state vectors have prior distributions of m types of transformed state vectors X.sub.t at a time t.
[0060] On the basis of state vectors output from the m observation models 31 and sets of observation data selected by the data selection processing unit 30, the posterior distribution creating unit 40 creates posterior distributions of the state vector. The posterior distribution creating unit 40 categorizes, in the created posterior distributions, a posterior distribution based on all the m types of observation data, selected by the data selection processing unit 30, as a first posterior distribution. The posterior distribution creating unit 40 also categorizes a posterior distribution based on observation data that lack one or more types of observation data out of the m types of observation data, selected by the data selection processing unit 30, as a second posterior distribution. The posterior distribution creating unit 40 outputs the created first posterior distribution and second posterior distribution to the posterior distribution unifying unit 50 and the like. For example, the posterior distribution creating unit 40 may store the created first posterior distribution in the first posterior distribution storage unit 41a that is readable by the posterior distribution unifying unit 50 and the like. The posterior distribution creating unit 40 may also store the created second posterior distribution in the second posterior distribution storage unit 41b that is readable by the posterior distribution unifying unit 50 and the like. The posterior distribution creating unit 40 also outputs the created first posterior distribution to the system model 21 and the output unit 60. In this case, for example, the posterior distribution creating unit 40 may store the created first posterior distribution in the unified posterior distribution storage unit 52 that is readable by the system model 21 and the output unit 60.
[0061] Here, the creation processing of posterior distributions by the posterior distribution creating unit 40 will be described in detail. To the posterior distribution creating unit 40, prior distributions of m types of transformed X.sub.t at a time t and the sets of observation data OBS.sub.1 to OBS.sub.m are input. In general, a posterior distribution p(x|y) when a prior distribution p(x) and a distribution p(y) of observation data are input is, according to Bayes' theorem, expressed by the expression:
In the expression (9), p(y|x) in the numerator is referred to as a likelihood, which is an indicator of the goodness of fit of a state variable x to an observation value y. In the case in which an observation model 31 can be separated into a mapping h and a noise amount w, as expressed by the expression (8), for the likelihood p(y|x), a quantity calculated by the expression:
can be used. In the expression (10), r is the density function of the noise amount w. In the expression (10), the right side is redefined as a function LH of y and h(x). Further, a likelihood p(y.sub.1:m|x) in the case of m types of observation values y={y.sub.1, y.sub.2, . . . , y.sub.m} being obtained is, using a multiplication theorem recursively, expressed in a product form as:
In the expression (11), the first term p(y.sub.1|y.sub.1:0, x) is the probability of y.sub.1 when there is no observation data, that is, the likelihood p(y.sub.1|x) of x when y.sub.1 is obtained. The second term p(y.sub.2|y.sub.1:1, x) is the probability of y.sub.2 when y.sub.1 is obtained. However, the respective observation data are collected using separate sensors or the like, and no joint distribution of y.sub.1 and y.sub.2 exists. Thus, the second term, as a result, becomes the likelihood p(y.sub.2|x) of x when y.sub.2 is obtained. Therefore, the posterior distribution expressed by the expression (9) in this case is expressed by the expression:
In the expression (12), it is assumed that Z in the denominator is a normalization constant. If this relation is used, because of m types of observation data y having been obtained as OBS.sub.1 to OBS.sub.m, the posterior distribution of the state variable U.sub.k at a grid point k is, assuming the prior distribution of U.sub.k being denoted by p(U.sub.k), expressed by the expression:
The numerator is, as expressed by the expression (12), the product of the product of the likelihoods based on the respective sets of observation data and the prior distribution p(U.sub.k). Further, since each likelihood is expressed by the expression (10), the posterior distribution of the expression (13) is expressed by the expression:
As described above, the posterior distribution creating unit 40 calculates the posterior distribution of the state variable U.sub.k at a grid point k on the basis of m types of likelihoods LH, which are calculated on the basis of m sets of observation data OBS.sub.1 to OBS.sub.m and the mappings h.sub.1 to h.sub.m, and the prior distribution p(U.sub.k). In a similar manner, the posterior distribution creating unit 40 calculates posterior distributions with respect to all the grid points 1 to L using the expression (13), that is, the expression (14).
[0062] However, the posterior distribution creating unit 40 uses the expression (15) below in place of the expression (13) with respect to a grid point at which one or more types of observation data in the m types of observation data are missing. For example, as OBS.sub.2 illustrated in
and the number of likelihoods included in the numerator decreases to m−1. In the expression (15) and the expression (16), the expression “m−1” indicates that at least one type of observation data in the m types of observation data have not been obtained and does not limit the number of types of observation data that have not been obtained (are missing) to one.
[0063] As described above, the posterior distribution creating unit 40 creates a posterior distribution for each of the state variables at each of the grid points. Hereinafter, the posterior distribution for each of the state variables at each of the grid points is also referred to as a posterior distribution with respect to each combination of a state variable and a grid point. The posterior distribution creating unit 40 outputs a posterior distribution calculated on the basis of all the observation data using the expression (13) as a first posterior distribution. The posterior distribution creating unit 40 also outputs a posterior distribution calculated on the basis of observation data that lack at least one type of observation data in the m types of observation data using the expression (15) as a second posterior distribution.
[0064] It is now assumed that a prior distribution p(x) follows a normal distribution with a mean μ0 and a variance V.sub.prio, and n observation values y.sub.1, Y.sub.2, . . . , y.sub.n also follow a normal distribution with a mean μ and a variance V. In this case, the posterior distribution p(x|y), which is calculated according to Bayes' theorem expressed by the expression (9), also becomes a normal distribution, and the variance V.sub.post thereof is expressed by the expression:
This indicates that, as the number of observation values used for calculation of the posterior distribution increases, the variance decreases, that is, the accuracy of the posterior distribution improves.
[0065] While a first and a second posterior distribution are not always normal distributions individually, smaller pieces of observation data are taken into a second posterior distribution than those into a first posterior distribution. Thus, the variance of the first posterior distribution P(U.sub.k|OBS.sub.1:m) of the expression (13) and the variance of the second posterior distribution p(U.sub.k′|OBS.sub.1:m−1) of the expression (15) are respectively denoted by Var(p(U.sub.k|OBS.sub.1:m)) and Var(p(U.sub.k′|OBS.sub.1:m−1)). Then, an inequality expressed by the expression (18) below holds:
Var(p(U.sub.kOBS.sub.1:m))≦Var(p(U.sub.k′|OBS.sub.1:m−1)) (18).
[0066] Next, the posterior distribution unifying unit 50 will be described. The posterior distribution unifying unit 50 unifies a first posterior distribution and a second posterior distribution. More specifically, the posterior distribution unifying unit 50 calculates a new posterior distribution for each combination of a state variable and a grid point for which the second posterior distribution has been calculated by unifying the first posterior distribution and the second posterior distribution into the new posterior distribution. The posterior distribution unifying unit 50 outputs the new posterior distribution after unification to the determining unit 51. Since, in the present example embodiment, a posterior distribution is approximated by a set of ensembles, the posterior distribution unifying unit 50 may perform the unification by means of superposing ensembles approximating a first posterior distribution and ensembles approximating a second posterior distribution at a predetermined ratio.
[0067] Specifically, the posterior distribution unifying unit 50 obtains the afore-described first posterior distributions and second posterior distributions as input from the first posterior distribution storage unit 41a and the second posterior distribution storage unit 41b. Because of the relation expressed by the expression (18), p(U.sub.k′|OBS.sub.1:m−1), which is one of the second posterior distributions, has a larger variance (that is, lower accuracy) than does at least p(U.sub.k|OBS.sub.1:m), which is one of the first posterior distributions. Thus, the posterior distribution unifying unit 50 calculates a new posterior distribution for each combination of a state variable and a grid point for which the second posterior distribution has been calculated by unifying the first posterior distribution and another second posterior distribution into a new post-posterior distribution. For example, it is assumed that, with respect to a grid point j, a second posterior distribution p(U.sub.j|OBS.sub.1:m−1) has been calculated. In this case, with respect to the grid point j, the posterior distribution unifying unit 50, assuming that g is a function, calculates a new posterior distribution p′(U.sub.j|OBS.sub.1:m) by the expression (19) below:
p′(U.sub.j|OBS.sub.1:m)=g(p(U.sub.k|OBS.sub.1:m), p(U.sub.i|OBS.sub.1:m−1), π) (19).
Here, π denotes a parameter set that determines the function g. In addition, k denotes a grid point at which the first posterior distribution has been created. Further, i denotes another grid point at which the second posterior distribution has been created. In the expression, i≠j holds. Hereinafter, the dash (′) of the probability distribution p′ in the expression (19) indicates that the probability distribution p′ is a probability distribution after unification performed by the posterior distribution unifying unit 50. The posterior distribution unifying unit 50 outputs the posterior distribution p′(U.sub.j|OBS.sub.1:m) newly calculated in such a way and the original second posterior distribution p(U.sub.j|OBS.sub.1:m−1) to the determining unit 51.
[0068] Next, the determining unit 51 will be described. The determining unit 51 determines which one of a second posterior distribution or a unified posterior distribution is to be used. More specifically, for each combination of a state variable and a grid point for which a second posterior distribution has been created, the determining unit 51 determines which one of the original second posterior distribution and the unified posterior distribution is to be used as a posterior distribution. Specifically, the determining unit 51 may store the determined posterior distribution in the unified posterior distribution storage unit 52. In the unified posterior distribution storage unit 52, as described above, a first posterior distribution has been stored. The storing operation causes the first posterior distribution or the determined posterior distribution to be stored in the unified posterior distribution storage unit 52 for each combination of a state variable and a grid point.
[0069] For example, the determining unit 51 may determine, on the basis of the respective variance values of a second posterior distribution and a unified posterior distribution, which one is to be used. Specifically, to the determining unit 51, the unified posterior distribution p′(U.sub.j|OBS.sub.1:m), which is newly calculated by the posterior distribution unifying unit 50, and the original second posterior distribution p(U.sub.j|OBS.sub.1:m−1) are input. Both of these posterior distributions are posterior distributions at a grid point j. For example, the determining unit 51 may, as with the expression (18), calculate and compare the variances of these posterior distributions. In this case, if the variance of the unified posterior distribution p′(U.sub.j|OBS.sub.1:m) is smaller, the determining unit 51 selects and outputs the unified posterior distribution. The determining unit 51 also stores the selected posterior distribution in the unified posterior distribution storage unit 52.
[0070] On the other hand, if the variance of the unified posterior distribution p′(U.sub.j|OBS.sub.1:m) is larger, the determining unit 51 may repeat the calculation by varying the parameter π of the function g in the expression (19) until the variance of the unified posterior distribution becomes smaller than the variance of the original second posterior distribution. For example, in the case in which the function g is a weighted average function, the determining unit 51 may vary weighting factors thereof. The determining unit 51 may assume a prior distribution p(π.sub.prio) for the parameter π, and calculate a posterior distribution p(π.sub.post) of the parameter π that minimizes the variance thereof by using Bayes' theorem, expressed by the expression (9), with variance values of the expression (4) treated as observation values. When varying the parameter π results in the variance of the unified posterior distribution becoming smaller than the variance of the original second posterior distribution, the determining unit 51 selects and stores the unified posterior distribution in the unified posterior distribution storage unit 52. In the case in which varying the parameter π does not cause the variance to be smaller, the determining unit 51 selects and stores the original second posterior distribution in the unified posterior distribution storage unit 52.
[0071] In this way, in the unified posterior distribution storage unit 52, the whole set of posterior distributions of the state variable U.sub.k at a time t at all the grid points k (k=1 to L) is completed with the first posterior distribution and the unified posterior distribution or the second posterior distribution, which has been selected by the determining unit 51.
[0072] Next, the output unit 60 will be described. In the case of continuing the simulation, the output unit 60 inputs the state vector at a time t, which generated from a posterior distribution selected by the determining unit 51 and a first posterior distribution, to the system model 21. The system model 21, using the posterior distributions at the time t, calculates prior distributions at a time t+1, which is the next time step. The output unit 60 outputs, as a result from the simulation, a time series of the state vector, which is generated from the posterior distribution selected by the determining unit 51 and the first posterior distribution, to the output device 1006 and the like.
[0073] As described above, in the unified posterior distribution storage unit 52, the whole set of posterior distributions of the state variable U.sub.k at a time t at all the grid points k (k=1 to L) is completed with the first posterior distribution and the unified posterior distribution or the second posterior distribution, which has been selected by the determining unit 51. The output unit 60 may input posterior distributions for the respective combinations of a state variable and a grid point, which are stored in the unified posterior distribution storage unit 52, to the system model 21 and output a time series thereof.
[0074] Although, as described thus far, the configurations of the respective functional blocks are described using the state variables U.sub.k (k=1 to L) as an example, the respective functional blocks are configured in the same manner with respect to other state variables (for example, V.sub.k (k=1 to L)).
[0075] An operation of the simulation device 100 configured as described above will be described with reference to the drawings.
[0076] First, an operation that the simulation device 100 performs at the start of a simulation will be described using
[0077] In
[0078] Next, the input unit 10 obtains first to M-th sets of observation data (step S102).
[0079] Next, referring to the information on the state variables set by the system model 21, the data selection processing unit 30 selects m types of observation data to be used from the first to M-th sets of observation data (step S103).
[0080] Next, the data selection processing unit 30 sets a relational expression relating the state variables and the m types of observation data with each other and noise amounts included therein, and creates the observation models 31-1 to 31-m (step S104). For example, the data selection processing unit 30 may set the relational expression and noise amounts on the basis of types, properties, and physical quantities of the sets of observation data, the numbers of dimensions of the sets of observation data and state variables, and the like. This causes the m observation models 31 to be created.
[0081] With this processing, the simulation device 100 completes the operation performed at the start of a simulation.
[0082] Next, an operation by which the simulation device 100 performs a simulation will be described using
[0083] In
[0084] Next, the system model 21 calculates ensembles at a next time step, that is, prior distributions, and stores the calculated prior distributions in the prior distribution storage unit 22 (step S202).
[0085] The input unit 10 now determines whether or not at least any of the first to m-th sets of observation data is obtained at the time of this time step (step S203).
[0086] In the case in which no observation data is obtained (No in step S203), the system model 21, using the prior distributions at the next time step, which is stored in the prior distribution storage unit 22, performs step S202 again and performs calculation of advancing one more time step.
[0087] It is assumed that it is also determined No in step S203 in the case of being specified not to revise data at this time step even when any set of observation data is obtained.
[0088] On the other hand, in the case in which at least any of the first to m-th sets of observation data is obtained and data are to be revised (Yes in step S203), each of the observation models 31-1 to 31-m transforms prior distributions stored in the prior distribution storage unit 22 (step S204).
[0089] At this time, in an identical simulation, the observation models created in step S104 at the start of the simulation are basically used as the observation models 31-1 to 31-m. However, even in an identical simulation, in an exceptional case, such as a case in which the behavior of observation data substantially changes and a case in which simulation calculation does not work well, the afore-described step S104 may be performed again. In this case, in this step S204, transformation may be performed using newly-created m observation models 31.
[0090] Next, the posterior distribution creating unit 40 creates a posterior distribution for each combination of a state variable and a grid point on the basis of the created m types of transformed prior distributions and the m types of observation data at the time of this time step (step S205). This operation causes the original prior distributions to be revised.
[0091] Next, the posterior distribution creating unit 40 determines whether the posterior distribution, created in step S205, for each combination of a state variable and a grid point is a posterior distribution based on all the m types of observation data selected in step S103 or a posterior distribution based on observation data that lack a portion of the m types of observation data (step S206).
[0092] When the posterior distribution is determined to be a posterior distribution based on all the m types of observation data, the posterior distribution creating unit 40 stores the posterior distribution, as a first posterior distribution, in the first posterior distribution storage unit 41a (step S207).
[0093] In this case, the posterior distribution creating unit 40 also stores the first posterior distribution, as a posterior distribution for the combination of a state variable and a grid point, in the unified posterior distribution storage unit 52 (step S208).
[0094] On the other hand, in step S206, when the posterior distribution is determined to be a posterior distribution based on observation data that lack a portion of the m types of observation data, the posterior distribution creating unit 40 categorizes the posterior distribution as a second posterior distribution and calculates a variance value V0 thereof (step S209). The posterior distribution creating unit 40 stores the second posterior distribution and the variance value V0 thereof in the second posterior distribution storage unit 41b.
[0095] Next, the posterior distribution unifying unit 50 calculates a new posterior distribution (unified posterior distribution) whose variance value V is minimum by, for each combination of a state variable and a grid point for which the second posterior distribution is created, unifying the first posterior distribution and the second posterior distribution (step S300).
[0096] Specifically, the posterior distribution unifying unit 50 may, for each target combination of a state variable and a grid point, calculate a unified posterior distribution repeatedly using the expression (19) and search for π that minimizes the variance value V while varying a parameter set π. For example, the posterior distribution unifying unit 50 may perform the search using the least squares method or Bayes' theorem. A minimum value of the variance, obtained from the search, is denoted by Vmin.
[0097] Next, the determining unit 51 compares the minimum value Vmin of the variance with the variance value V0 before unification (step S301).
[0098] If the minimum value Vmin of the variance after unification is smaller, the determining unit 51 sets the unified posterior distribution as a new posterior distribution for the combination of a state variable and a grid point (step S302), and stores the unified posterior distribution in the unified posterior distribution storage unit 52 (step S208).
[0099] On the other hand, if the minimum value Vmin of the variance after unification does not become smaller, the determining unit 51 discontinues the unification, sets the second posterior distribution as a posterior distribution for the combination of a state variable and a grid point (step S303), and stores the second posterior distribution in the unified posterior distribution storage unit 52 (step S208).
[0100] Next, if the simulation does not reach a predefined time or a predefined step (No in step S304), the simulation device 100 repeats the operations after step S202. That is, the system model 21 performs step S202 using, as input, posterior distributions for the respective combinations of a state variable and a grid point, stored in the unified posterior distribution storage unit 52, and starts calculation of the next step.
[0101] On the other hand, when the simulation reaches the predefined time or the predefined step (Yes in step S304), the output unit 60 outputs a time series of posterior distributions for the respective combinations of a state variable and a grid point, stored in the unified posterior distribution storage unit 52, and finishes the simulation operation.
[0102] Next, an advantageous effect of the first example embodiment of the present invention will be described.
[0103] The simulation device as the first example embodiment of the present invention may perform a high-resolution and high-accuracy simulation over a wide range taking into consideration non-ideal observation data and observation data that have a discontinuity or peculiarity.
[0104] The reasons for the above advantageous effect will be described. In the present example embodiment, the system model simulates time evolutions of the state vector. The data selection processing unit selects m types of observation data from M types of observation data. The m observation models each of which corresponds to one of the m types of observation data transform prior distributions of the state vector at a next step, which is calculated by the system model, on the basis of relationships between the m types of observation data and the state vector. Based on the transformed m types of prior distributions and the selected m types of observation data, the posterior distribution creating unit creates a posterior distribution for each combination of a state variable and a grid point. The posterior distribution creating unit categorizes, in the created posterior distributions, a posterior distribution based on all the m types of observation data as a first posterior distribution and a posterior distribution based on observation data that lack a portion of the m types of observation data as a second posterior distribution. The posterior distribution unifying unit, for each combination of a state variable and a grid point for which the second posterior distribution is created, unifies the first posterior distribution and the second posterior distribution, and creates a new posterior distribution. The determining unit determines which one of the second posterior distribution or the new posterior distribution is to be selected, and sets the determined posterior distribution as a posterior distribution after unification for the combination of a state variable and a grid point. The system model calculates the state vector at the next step using, as input, posterior distributions of the state vector, which are generated from the first posterior distribution and the unified posterior distribution.
[0105] Because of the above-described reasons, even in the case in which a portion of the observation data are inappropriate or include a lot of errors, the present example embodiment may, by unification of posterior distributions, perform revision taking into consideration other observation data. Alternatively, since such observation data come not to be used for revision, the present example embodiment may prevent an error from increasing. Since, even for observation data that have a low measurement frequency, the present example embodiment may perform revision by taking into consideration observation data that have a high measurement frequency, the present example embodiment enables simulation with higher accuracy.
[0106] The advantageous effect as described above will be described using
[0107] Comparison between
[0108] As described above, even when observation data that, when used alone, include only an insufficient number of pieces of data or have a distribution that is biased spatially and temporally are provided, the present example embodiment may, by using a variety of types of such observation data, enable a high-resolution and high-accuracy simulation to be performed over a wider range. In the future, due to progress in observation technologies and information collection from a large number of sensors, as in, for example, M2M (Machine-to-Machine), a larger variety of and a large quantity of observation data are expected to be collected. In such a situation in which a larger variety of and a larger quantity of data are collected, the present example embodiment may, by using information from a plurality of sets of observation data in a unifying manner, perform a more effective simulation in comparison with the related technology in which the accuracy of simulation is constrained by characteristics of observation data.
Second Example Embodiment
[0109] Next, a second example embodiment of the present invention will be described with reference to the drawings. The present example embodiment is applicable to simulation using observation data that are spatially discrete but the values of which are of high accuracy and observation data that are spatially continuous but the values of which are of insufficient accuracy. In the following description, a specific example in which, using a simulation device in the present invention, simulation of soil moisture content is performed will be described. In the respective drawings referenced in the second example embodiment of the present invention, the same signs are assigned to the same components and steps as those in the first example embodiment of the present invention and a detailed description thereof in the present example embodiment will be omitted.
[0110] First, a configuration of a simulation device 200 as the second example embodiment of the present invention is illustrated in
[0111] In the present example embodiment, a soil initial state is applied as an initial state in the present invention, and terrain/weather parameters are applied as parameters in the present invention. As two (M=2) sets of observation data, two types of observation data, soil moisture data OBS.sub.1 and satellite data OBS.sub.2, are applied.
[0112] Here, the two types of observation data to be used in the present example embodiment, the soil moisture data OBS.sub.1 and the satellite data OBS.sub.2, will be described. In
[0113] The soil moisture data OBS.sub.1 may be, for example, observation values obtained from dielectric constant soil moisture sensors, which are buried under soil and calculate soil moisture values on the basis of dielectric constants. In addition, the soil moisture data OBS.sub.1 may be observation values collected by other types of sensors. A feature of the soil moisture data OBS.sub.1 is that, although observation values are discrete in space because only values at points where sensors are placed can be measured, the observation values are of high accuracy because physical quantities equivalent to soil moisture are directly measured. In
[0114] The satellite data OBS.sub.2 may be, for example, remote sensing data obtained from the ASTER sensor mounted on the Terra satellite (Terra/ASTER). More specifically, data, collected by the Terra/ASTER, representing the intensity of reflected light from sunlight in the near-infrared (Band 3, 0.78-0.86 μm) and short-wavelength infrared (Band 4, 1.600-1.700 μm) wavelengths are applicable as the satellite data OBS.sub.2. In addition, as the satellite data OBS.sub.2, data collected by other methods or in other wavelengths may be applicable. A feature of the satellite data OBS.sub.2 is that, since the intensity of reflected light off the surface of the ground from sunlight in the near-infrared and short-wavelength infrared wavelength ranges can be collected as two-dimensional image data, observation values are continuous in space. However, since the satellite data OBS.sub.2 are estimated on the basis of obtained data using a statistically significant correlation between the intensity of reflected light, reflectivity, or the like in the above wavelengths and moisture content of ground surface layer soil, observation values are indirect values and, thus, there is a possibility that the accuracy thereof becomes insufficient.
[0115] Next, the soil model 221 will be described. The soil model 221 is an example of the system model 21 in the first example embodiment of the present invention. The soil model 221 calculates the space and time variation of soil moisture content and the like using, as parameters, physical properties of soil to be observed, such as degrees of slope and drainage, and weather conditions, such as precipitation. To the soil model 221, for example, an LSM (LAND-SURFACE MODEL) may be applied. To the soil model 221, a soil module of a decision support system for agriculture DSSAT (Decision Support System for Agrotechnology Transfer) and the like may also be applied.
[0116] The posterior distribution unifying unit 250, as with the first example embodiment of the present invention, calculates a new posterior distribution by unifying a first posterior distribution and a second posterior distribution into the new posterior distribution for each combination of a state variable and a grid point for which the second posterior distribution is created. In addition, in performing the unifying processing, the posterior distribution unifying unit 250 may use a model that is created on the basis of spatial correlations between respective posterior distributions having been already calculated. As the model, for example, a covariance function and a variogram function are applicable. However, the posterior distribution unifying unit in the present invention may use another model based on spatial correlations between respective posterior distributions. In this case, the posterior distribution unifying unit 250 may apply Bayesian updating to parameters, which characterize arithmetic operations in the model used for unification, on the basis of spatial correlations between respective posterior distributions having been already calculated. The processing using a model based on spatial correlations and the processing of applying Bayesian updating to the parameter thereof will be described in the following description of an operation in conjunction with a specific example.
[0117] A specific example of an operation of the simulation device 200 configured as described above will be described.
[0118] First, the soil model 221 obtains a soil initial state and terrain/weather parameters via the input unit 10 and sets soil moisture content SM.sub.k at a grid point k to a state variable (step S101 in
[0119] When it is assumed that state variables at a grid point k (k=1 to 9) illustrated in
X.sub.t =(SM.sub.1, SM.sub.2, . . . , SM.sub.9),.sub.t.sup.T (20).
Here, a description will be made mainly on an example in which only soil moisture content is set as a state variable at a grid point. However, in addition to a dynamic variable varying in time and a quantity the value of which is to be estimated, a static variable is applicable as a state variable. The state variables may be chosen depending on a phenomenon subject to simulation, a system model, a purpose, and the like. The state variables may be chosen so that, as expressed by the expression (2), a state vector at a time can be created on the basis of a state vector at the previous step and the soil model 221. Since, as the number of state variables increases, calculation amount increases, the state variables are preferably set appropriately in accordance with allowable computational resources.
[0120] Next, a data selection processing unit 30 obtains two types of observation data (step S102 in
[0121] Next, the data selection processing unit 30 create two observation models including a first observation model 231-1 related to the soil moisture data OBS.sub.1 and a second observation model 231-2 related to the satellite data OBS.sub.2 (step S104).
[0122] A case in which the soil moisture data OBS.sub.1 have the same number of dimensions as that of the state variables SM and noises in observation values follow Gaussian (normal) distributions is assumed here. It is also assumed that, as illustrated in
OBS.sub.1=X+w.sub.1 (21).
Here, the observation noise w.sub.1 may be set to be, for example, a Gaussian distribution with a mean of 0 and a variance σ1. In this way, the data selection processing unit 30 creates the first observation model 231-1 expressed by the expression (21).
[0123] It is assumed that, with regard to the satellite data OBS.sub.2, the intensity of reflected light or reflectivity observed in the near-infrared and short-wavelength infrared wavelengths and soil moisture content are related with each other by means of a non-linear function h.sub.2. It is, however, assumed that observation grid points coincide with the calculation grid points, as with the soil moisture data OBS.sub.1. In this case, the observation data OBS.sub.2 and the state variables are, according to the observation model equation expressed by the expression (8), expressed by a non-linear relational expression expressed by the expression:
OBS.sub.2=h.sub.2(X, w.sub.2) (22).
Here, the observation noise w.sub.2 may also be set to be, for example, a Gaussian distribution with a mean of 0 and a variance σ2. In this way, the data selection processing unit 30 creates the second observation model 231-2 expressed by the expression (22).
[0124] Next, the soil model 221, at the start point of a simulation, obtains ensembles based on the soil initial state (t=0 in the expression (20)), the terrain/weather parameters, and ensembles representing a system noise. The soil model 221 calculates prior distributions of the state vector at t=1 using the time evolution equation of each ensemble, expressed by the expression (4), and stores the calculated prior distributions in a prior distribution storage unit 22 (step S202 in
[0125] Next, it is assumed that, at the time t=1, the observation data OBS.sub.1 and OBS.sub.2 is obtained (Yes in step S203). Thus, the observation models 231-1 and 231-2 transform the ensembles of the state vector at the time t=1, stored in the prior distribution storage unit 22, using the expressions (21) and (22) (step S204).
[0126] Next, the posterior distribution creating unit 40, for each grid point, calculates a posterior distribution using Bayes' theorem expressed by the expression (9) (step S205). However, as illustrated in
Here, the expression (23) holds true for i=1, 3, and 8. It is assumed that OBS1i and OBS2i denote pieces of the observation data OBS.sub.1 and OBS.sub.2 obtained at the grid point i, respectively. The first posterior distributions at the grid points 1, 3, and 8, which are calculated using the expression (23), are stored in a first posterior distribution storage unit 41a (Y in step S206 and step S207). The first posterior distributions at the grid points 1, 3, and 8 are also stored in a unified posterior distribution storage unit 52 (step S208).
[0127] Since, at each of the latter grid points 2, 4, 5, 6, 7, and 9, one of the types of observation data selected by the data selection processing unit 30 is missing, the posterior distribution creating unit 40 calculates a second posterior distribution using the expression (24) below, which is based on the expression (15):
Here, the expression (24) holds true for j=2, 4, 5, 6, 7, and 9. The second posterior distributions at the grid points 2, 4, 5, 6, 7, and 9, which have been calculated using the expression (24), are stored in a second posterior distribution storage unit 41b (N in step S206 and step S209).
[0128] Next, the posterior distribution unifying unit 250 unifies the first and second posterior distributions, which are calculated using the expressions (23) and (24) (step S300).
[0129] In the present example embodiment, as a function g for unifying posterior distributions, which is expressed by the expression (19), for example, a linear combination of posterior distributions at surrounding grid points is considered. For example, with regard to the grid point 2 illustrated in
p′(SM.sub.2|OBS.sub.1,OBS.sub.2)=α.sub.1p(SM.sub.1|OBS.sub.1,OBS.sub.2)+α.sub.3p(SM.sub.3|OBS.sub.1,OBS.sub.2)+. . . +α.sub.9p(SM.sub.9|OBS.sub.2) (25).
Here, α1 to α9 are weighting factors that are equivalent to a parameter set grin the expression (19). Hereinafter, the dash (′) of the probability distribution p′ in the expression (25) indicates that the probability distribution is a probability distribution after unification by the posterior distribution unifying unit 250. Then, the expression (25) may be considered equivalent to the so-called Kriging method, in which an unknown value at the grid point 2 is determined on the basis of a probabilistic interrelation with values at surrounding grid points, that is, a spatial correlation. The values at the grid points are, however, not definite values but posterior distributions calculated using the expressions (23) and (24). That is, when a covariance function expressing a spatial correlation between posterior distributions p(SM|OBS) of soil moisture content at a position r.sub.k of a grid point k and a position r.sub.k+γ of a grid point separated from the grid point k by a distance γ.
C(γ)=C{p(SM(r.sub.k)|OBS), p(SM(r.sub.k+γ)|OBS)} (26)
is obtained, the weighting factors α1 to α9 in the expression (25), that is, the parameter set π, is also obtained. In the expression (26), SM(x) denotes a state variable SM at a grid point located at a position x. In addition, OBS denotes m types of observation data. The parameter set can be obtained by solving a simple Kriging equation system as expressed by, for example, the expression (27) below. In the present invention, the method for obtaining the parameters π in the function g, which the posterior distribution unifying unit uses in unifying posterior distributions, is not limited to the above-described method, and may be another method.
[0130] Next, an operation of obtaining a covariance function expressed by the expression (26) will be described. Since, between a covariance function C(γ) and a variogram function V(γ), a simple relation:
V(γ)=C(0)−C(γ) (28)
holds, it may be good to obtain either of the functions. In the following description, a case of, in the posterior distribution unifying unit 250, obtaining a variogram function V(γ)first will be described. A variogram, as with a covariance function, represents a probabilistic interaction, that is, a spatial correlation between a position r.sub.k of a grid point k and a position r.sub.k+γ of a grid point separated from the grid point k by a distance γ. On the left side of
V(γξ)=τ.sup.2+σ.sup.2(1−exp(−φ∥γ∥.sup.2)) ξ=(τ.sub.2, σ.sup.2, φ) (29)
is fit, and parameters ξ thereof are estimated. In the expression (29), ξ denotes a set of three types of parameters characterizing a variogram, which are generally referred to as a nugget τ.sup.2, a range φ, and a sill σ.sup.2. In the example, a result from an estimation performed, according to Bayes' theorem expressed by the expression (9), with respect to a range φ and a nugget τ.sup.2, among the parameters, is illustrated. Specifically, assuming a uniform prior distribution for the range φ and an exponential prior distribution for the nugget τ.sup.2 because of values close to 0 being expected for the nugget τ.sup.2, posterior distributions of the respective parameters were obtained on the basis of actually calculated variograms according to Bayes' theorem. Examples of obtained results are illustrated on the right side of
[0131] Therefore, since the covariance function C(γ) expressed by the expression (26) is calculated, the simple Kriging equation system expressed by the expression (27), for example, can be solved. Since, by that, the coefficients in the expression (25) expressing unification of posterior distributions, that is, the parameter set π, is calculated, the posterior distribution unifying unit 250 is able to obtain the unified posterior distribution p′(SM.sub.2|OBS.sub.1, OBS.sub.2) at the grid point 2. The posterior distribution unifying unit 250 also obtains a unified posterior distribution p′(SM.sub.k|OBS.sub.1, OBS.sub.2) with respect to another grid point k at which a second posterior distribution is created in the same manner.
[0132] Here, details of the unification operation performed by the posterior distribution unifying unit 250 in step S300 are illustrated in
[0133] In
[0134] Next, the posterior distribution unifying unit 250 defines a function that may fit to the variograms or covariances calculated in step S401 (step S402).
[0135] Next, the posterior distribution unifying unit 250 assumes prior distributions for parameters of the function defined in step S402 (step S403).
[0136] Next, the posterior distribution unifying unit 250 obtains posterior distributions of the parameters by updating the prior distributions, assumed in step S403, of the parameters on the basis of the calculated variograms or covariances by use of Bayes' theorem (step S404).
[0137] Next, the posterior distribution unifying unit 250 derives a covariance function using the posterior distributions of the parameters obtained in step S404 (step S405).
[0138] Next, the posterior distribution unifying unit 250, using a Kriging equation, obtains weighting factors (parameter set π) used in unifying posterior distributions at grid points other than the target grid point (step S406).
[0139] Next, the posterior distribution unifying unit 250, using the parameter set π obtained in step S406, unifies posterior distributions at grid points other than the target grid point (step S407).
[0140] In this way, in step S300 in
[0141] Subsequently, the simulation device 200 performs steps S301 to S304 and S208 in the same manner as in the first example embodiment of the present invention. By this, with respect to each grid point at which a second posterior distribution is created, a unified posterior distribution or the second posterior distribution is stored in the unified posterior distribution storage unit 52. The soil model 221, using the state vector generated from posterior distributions at the time step, which are stored in the unified posterior distribution storage unit 52, continues calculation for the next time step.
[0142] Next, an advantageous effect of the second example embodiment of the present invention will be described.
[0143] The simulation device as the second example embodiment of the present invention may perform a high-resolution and high-accuracy simulation over a wide range taking into consideration non-ideal observation data and observation data that have a discontinuity or peculiarity.
[0144] The reasons for the above advantageous effect will be described. That is because the present example embodiment includes the following configuration in addition to the same configuration as that of the first example embodiment of the present invention. That is, that is because the posterior distribution unifying unit, in unifying a first posterior distribution and a second posterior distribution with respect to each grid point at which the second posterior distribution is created, uses a model that is created on the basis of spatial correlations between calculated posterior distributions. That is also because the posterior distribution unifying unit applies Bayesian updating to parameters characterizing arithmetic operations in the model used in the unification on the basis of spatial correlations between calculated posterior distributions.
[0145] Accordingly, even when observation data that, when used alone, include only an insufficient number of pieces of data or have a distribution that is biased spatially are provided, by using such observation data in plural varieties, the present example embodiment may unify posterior distributions at grid points with higher accuracy. As a result, the present example embodiment enables a high-resolution and high-accuracy simulation to be performed over a wider range.
[0146] In the second example embodiment of the present invention, an example in which a soil model is applied as the system model, soil sensor data and satellite data are applied as a plurality of sets of observation data, and simulation of soil moisture content is performed is described. In addition thereto, the present example embodiment may be embodied for another object using another system model and observation data. For example, the present example embodiment may be embodied by applying a weather model as the system model and weather sensor data and satellite data as a plurality of sets of observation data.
Third Example Embodiment
[0147] Next, a third example embodiment of the present invention will be described with reference to the drawings. The present example embodiment may be applied to simulation in the case in which observation grid intervals of a plurality of sets of observation data differ from one another and simulation in the case in which collection time intervals thereof differ from one another. In the following description, a specific example in which simulation of crop growth is performed using a simulation device of the present invention will be described. In the drawings referenced in the third example embodiment of the present invention, the same signs are assigned to the same components and steps as those in the first example embodiment of the present invention and a detailed description thereof in the present example embodiment will be omitted.
[0148] First, a configuration of a simulation device 300 as the third example embodiment of the present invention is illustrated in
[0149] In the present example embodiment, a soil initial state is applied as an initial state in the present invention, and terrain/weather/crop parameters are applied as parameters in the present invention. As two (M=2) sets of observation data, two types of satellite data (remote sensing data) are applied.
[0150] Two types of observation data to be used in the present example embodiment, satellite data OBS.sub.1 and satellite data OBS.sub.2, will now be described.
[0151] As illustrated in
[0152] The first satellite data, which are collected at a high frequency and have a low spatial resolution, may be data obtained from, for example, a MODIS sensor mounted on the Terra satellite or the AQUA satellite (Terra-AQUA/MODIS). More specifically, data, collected by the Terra-AQUA/MODIS, representing the intensity of reflected light from sunlight in the visible red band (wavelength of 0.58-0.86 μm) and near-infrared band (wavelength of 0.725-1.100 μm) are applicable as the first satellite data. The first satellite data as described above can, although depending on the latitude of a region where data are collected, be collected every day basically. However, the first satellite data as described above have a ground level spatial resolution of as low as approximately 250 m.
[0153] Observation data that are usable as the second satellite data, which are collected at a low frequency and have a high spatial resolution, include observation data obtained from, for example, a LANDSAT satellite, a PLEIADES satellite, the ASNARO satellite, or the like. The wavelength range of satellite data collected by the above-described satellites is approximately the same as the wavelength of data collected as the first satellite data. The collection frequency and ground level resolution of the second satellite data as described above are, in the case of a LANDSAT satellite, every 8 to 16 days and approximately 30 m, and, in the case of a PLEIADES satellite and the ASNARO satellite, every 2 to 3 days and approximately 2 m.
[0154] The Normalized Difference Vegetation Index (NDVI), which is generally used as a vegetation index that indicates the growth state of a crop, can be calculated from reflectivity values in the afore-described two bands (the visible red band and the near-infrared band). The wavelength range of data collected as observation data is, however, not necessarily limited to the above bands. In the present example embodiment, the crop model 321 calculates the Leaf Area Index (LAI) as a quantity representing the growth state of a crop. The LAI is known to have a correlation with the vegetation index NDVI. The LAI as described above is calculated upon inputting data of a soil initial state and terrain/weather/crop parameters set to the crop model 321.
[0155] One of the differences between the present example embodiment and the other afore-described example embodiments of the present invention is that the grid intervals of the two types of observation data OBS.sub.1 and OBS.sub.2 differ from each other. Thus, the calculation grids of the crop model 321 are set in such a way as to coincide with the grids of at least either one of the observation data OBS.sub.1 and OBS.sub.2. With regard to the other observation data, a vector in an observation model equation expressed by the expression (7) may be changed so as to have, for example, weighted averages of values at neighboring grid points as elements thereof. Another difference between the present example embodiment and the other afore-described example embodiments of the present invention is that the collection time intervals of the two types of observation data OBS.sub.1 and OBS.sub.2 differ from each other. Thus, the posterior distribution unifying unit 350, by estimating posterior distributions obtained from the observation data OBS.sub.2, which are collected at a low frequency, on the basis of temporal correlations, unifies the posterior distributions obtained from the observation data OBS.sub.2 with posterior distributions obtained from the observation data OBS.sub.1, which are collected at a high frequency, in synchronization with collection times of the posterior distributions obtained from the observation data OBS.sub.1.
[0156] Using an example of two types of observation data OBS.sub.1 and OBS.sub.2 illustrated in
[0157] First, the crop model 321 sets leaf area indices LAI.sub.k as state variables at grid points k (k=1 to 16) illustrated in
[0158] Next, a data selection processing unit 30 obtains the two types of observation data (step S102), and selects the first satellite data OBS.sub.1 and the second satellite data OBS.sub.2 as m types of observation data to be used (step S103).
[0159] Next, the data selection processing unit 30 creates two observation models, a first observation model 331-1 related to the first satellite data OBS.sub.1 and a second observation model 331-2 related to the second satellite data OBS.sub.2 (step S104).
[0160] Referring to
[0161] Since grid points at which the second observation data OBS.sub.2 are collected correspond to the calculation grid points in a one-to-one manner, the observation model 331-2 is, using an identity matrix, expressed by the expression:
Here, H.sub.1 and H.sub.2 are mappings that include a mapping h that associates their respective sets of observation data with the state variables LAI and matrices that associate sets of grid points with each other. In addition, w.sub.1 and w.sub.2 are observation noises and may be set to, for example, a Gaussian distribution with a mean of 0 and a variance σ and the like. The observation models 331-1 and 331-2 expressed by the expressions (30) and (31) are specific examples of the observation model equation expressed by the expression (8).
[0162] Next, the crop model 321 obtains a soil initial state and terrain/weather/crop parameters and calculates prior distributions of the state vector at the next step in the simulation (steps S201 and S202 in
[0163] Next, a posterior distribution creating unit 40 creates a posterior distribution of the state variable LAI.sub.k at each grid point k (k=1 to 16) at the time t−1 in
and stores the calculated first posterior distribution in a first posterior distribution storage unit 41a (step S205, Yes in step S206, and step S207). In the expression (32), LH is a function that calculates a likelihood expressed by the expression (13), and Z in the denominator is a normalization constant.
[0164] A case in which transformed prior distributions p(LAI.sub.k) at a time t in
and stores the calculated second posterior distribution in a second posterior distribution storage unit 41b (step S205, No in step S206, and step S209).
[0165] Next, the posterior distribution unifying unit 350 unifies the first posterior distribution expressed by the expression (32) and the second posterior distribution expressed by the expression (33). Specifically, in the present example embodiment, as a function g for unifying posterior distributions, which is expressed by the expression (19), a linear combination of a second posterior distribution at a present time and a posterior distribution estimated from a first posterior distribution at a different time on the basis of a temporal correlation is applied. For example, with regard to a posterior distribution at the time t in
p′(LAI.sub.k|OBS.sub.1, OBS.sub.2).sub.t=α.sub.0p(LAI.sub.k|OBS.sub.1).sub.t+β.sub.0p(LAI.sub.k|OBS.sub.1, OBS.sub.2).sub.1:t−1 (34).
[0166] In the expression (34), α.sub.o and β.sub.0 are weighting factors that are equivalent to the parameter set π in the expression (19). In the expression (34), the dash (′) of the probability distribution p′ indicates that the probability distribution is a probability distribution after unification by the posterior distribution unifying unit 350.
[0167] A specific example of the processing of estimating a posterior distribution p(LAI.sub.k|OBS.sub.1, OBS.sub.2).sub.1:t−1 at a time t from posterior distributions at times (t−1, t−2, t−3, . . . ) before the time t based on a temporal correlations will now be described. In general, as a method for estimating a value at a time t from values at times (t−1, t−2, t−3, . . . ) before the time t, a so-called autoregressive (AR) model:
p(LAI.sub.k|OBS.sub.1, OBS.sub.2).sub.1:t−1=f.sub.AR(p(LAI .sub.k|OBS.sub.1, OBS.sub.2).sub.t−1, p(LAI.sub.k|OBS.sub.1, OBS.sub.2).sub.t−2, . . . ) (35)
is applicable. Here, a case in which an AR model f.sub.AR is expressed in a linear form is considered as an example. It is assumed that a time at which both the first satellite data OBS.sub.1 and the second satellite data OBS.sub.2 have been observed and a first posterior distribution has been created and that is a time before a time t is denoted by t−i (i=1 and 3 in
[0168] A case in which a posterior distribution at a time at which no second observation data OBS.sub.2 is obtained and a second posterior distribution is created (in
In the expression (37), the time t−i indicates a time at which a first posterior distribution is calculated, and the time t−j indicates a time at which a second posterior distribution is calculated. In the case of
[0169] Using the posterior distribution p(LAI.sub.k|OBS.sub.1, OBS.sub.2).sub.1:t−1 at the time t estimated in this way, the posterior distribution unifying unit 350 performs unification using the expression (34) (step S300).
[0170] Subsequently, the simulation device 300 executes steps S301 to S304 and S208 as with the first example embodiment of the present invention. By this, the unified posterior distribution or the second posterior distribution is stored in the unified posterior distribution storage unit 52 with respect to each grid point at a time t at which the second posterior distribution is created at the grid point. Using the state vector, which is generated from posterior distributions at a time t stored in the unified posterior distribution storage unit 52, the crop model 321 continues calculation for the next time step.
[0171] When a predefined time is reached, the simulation device 300 finishes the operation.
[0172] Next, an advantageous effect of the third example embodiment of the present invention will be described.
[0173] The simulation device as the third example embodiment of the present invention may perform a high-resolution and high-accuracy simulation over a wide range taking into consideration non-ideal observation data and observation data that have a discontinuity or peculiarity.
[0174] The reasons for the above advantageous effect will be described. That is because the present example embodiment includes the following configuration in addition to the same configuration as that of the first example embodiment of the present invention. In other words, that is because the posterior distribution unifying unit, in unifying a posterior distribution with respect to each grid point at which a second posterior distribution is created, uses a model that is created on the basis of temporal correlations among posterior distributions having been already calculated in the past.
[0175] As described above, the present example embodiment estimates, from posterior distributions having been already calculated in the past prior to a time t at which second posterior distributions are created, posterior distributions at the time t on the basis of temporal correlations, and, using the estimated posterior distributions, calculates unified posterior distributions at the time t. With this processing, the present example embodiment enables a plurality of sets of observation data that are collected at different frequencies to be unified in synchronization with collection timings of observation data that are observed at a higher frequency. That is, in the present example embodiment, prior distributions at a time t (simulation results) are revised to more probable unified posterior distributions at a shorter interval. As a result, the present example embodiment may reduce errors in estimating values after a next time step.
[0176] Although, in the present example embodiment, an example in which a crop model is applied as the system model and satellite data are applied as all of a plurality of sets of observation data is described, the present example embodiment does not limit the system model and the types and contents of observation data. For example, in the present example embodiment, a water dynamics and fluid model may be applied as a system model, and water level sensor data and satellite data of a river may be applied as observation data. As described above, the present example embodiment may be applied to a combination of observation data that are collected at a high frequency but are locally discrete and observation data that are collected at a low frequency but have a high resolution and are widespread, using a system model corresponding thereto appropriately.
Fourth Example Embodiment
[0177] Next, a fourth example embodiment of the present invention will be described with reference to the drawings. In the present example embodiment, a specific example in which, using a simulation device of the present invention, simulation of precipitation is performed will be described. The fourth example embodiment of the present invention is an example embodiment in which the calculation grid space in the second example embodiment of the present invention is expanded into a three-dimensional space. In the drawings referenced in the fourth example embodiment of the present invention, the same signs are assigned to the same components and steps as those in the second example embodiment of the present invention and a detailed description thereof in the present example embodiment will be omitted.
[0178] First, a configuration of a simulation device 400 as the fourth example embodiment of the present invention is illustrated in
[0179] Here, two types of observation data, the GPS precipitable water data OBS.sub.1 and acoustic radar data OBS.sub.2, which are assumed to be used in the present example embodiment will be described. GPS precipitable water is data obtained by estimating a vertically integrated water vapor content in the atmosphere, on the basis of a characteristic that, as water vapor in the atmosphere on a path until a radio wave radiated from a GPS (Global Positioning System) satellite reaches a GPS receiver increases, arrival time is delayed longer. GPS precipitable water has contributed to an improvement in the accuracy of estimation of a timing at which local heavy rain occurs and estimation of a total amount of rainfall in a round of rainfall. GPS precipitable water has a characteristic that, since, on the ground side, it is only required to arrange GPS receivers, densification is relatively easily achieved in the land surface. In contrast, with respect to the vertical direction, since GPS precipitable water is only an integrated amount in the vertical direction, it is difficult to express spatial distribution properly by means of GPS precipitable water. On the other hand, using acoustic radar enables the altitudinal dependency of water vapor content to be measured. For example, when a sound wave is emitted upward in the vertical direction and a scattering echo due to turbulence in the atmosphere is received, the echo depends on the altitudinal gradient of atmospheric refractivity. Moreover, the altitudinal gradient of atmospheric refractivity depends strongly on the altitudinal gradient of water vapor content. Therefore, observing the echo enables the altitudinal dependency of water vapor content to be measured.
[0180] The observation models 431 representing relationships between such two types of observation data and state variables will be described using
[0181] In the present example embodiment, with regard to the GPS precipitable water data OBS.sub.1, a value at an observation grid point can be associated with an integrated value of values at two calculation grid points having the same coordinate values in the xy-plane and different z (vertical) coordinate values. In
[0182] Next, with regard to the acoustic radar data OBS.sub.2, a value at an observation data collection grid point can be associated with the average of values at four calculation grid points that have the same z (vertical) coordinate value, that is, that are included in an identical plane. In
[0183] The simulation device 400 configured as described above operates in substantially the same manner as the simulation device 200 as the second example embodiment of the present invention.
[0184] In other words, the weather model 241 calculates prior distributions of the state vector at the next time step, which is calculated on the basis of a weather value initial state and terrain parameters (steps S201 and S202 in
[0185] A posterior distribution unifying unit 250 and a determining unit 51, with respect to the grid points 3 and 7 at which the second posterior distributions is created, create a unified posterior distribution, and determine which one of the created unified posterior distribution and the original second posterior distribution is to be used (steps S300 to S303). The weather model 421, using the state vector generated from posterior distributions, each of which is a first posterior distribution or a determined posterior distribution, with respect to the respective grid points, continues simulation. When a predefined time is reached (Yes in step S304), an output unit 60 outputs a time series of the state vector and finishes the operation.
[0186] As described above, the simulation device as the fourth example embodiment of the present invention is applicable to even a case in which observation data cannot be associated with grid points simply in a one-to-one manner and a simulation in a three-dimensional space. Even in such a case, the present example embodiment may, by using appropriate observation models, perform a high-resolution and high-accuracy simulation over a wide range taking into consideration non-ideal observation data and observation data that have a discontinuity or peculiarity.
[0187] In each of the above-described example embodiments of the present invention, the description is made mainly on an example in which the respective functional blocks of a simulation device are achieved by a CPU that executes a computer program stored in a storage device or a ROM. Without being limited to the above example, a portion or all of the functional blocks or a combination thereof may be achieved by dedicated hardware.
[0188] In each of the above-described example embodiments of the present invention, the functional blocks of a simulation device may be achieved in a distributed manner to a plurality of devices.
[0189] In each of the above-described example embodiments of the present invention, an operation of a simulation device that is described with reference to a flowchart may be stored in a storage device (storage medium) as a computer program of the present invention. Such a computer program may be configured to be read and executed by the CPU of the simulation device. In such a case, the present invention is configured as a code of such a computer program or a storage medium storing the computer program.
[0190] The above-described example embodiments may be embodied appropriately combined with one another.
[0191] The present invention is described using the above example embodiments thereof as typical examples. However, the present invention is not limited to the above example embodiments. That is, various modes that can be understood by a person skilled in the art may be applied to the present invention within the scope of the present invention.
[0192] This application claims priority based on Japanese Patent Application No. 2014-172371, filed on Aug. 27, 2014, the entire disclosure of which is incorporated herein by reference.
REFERENCE SIGNS LIST
[0193] 100, 200, 300, 400 Simulation device
[0194] 10 Input unit
[0195] 21 System model
[0196] 221 Soil model
[0197] 321 Crop model
[0198] 421 Weather model
[0199] 22 Prior distribution storage unit
[0200] 30 Data selection processing unit
[0201] 31, 231, 331, 431 Observation model
[0202] 40 Posterior distribution creating unit
[0203] 41a First posterior distribution storage unit
[0204] 41b Second posterior distribution storage unit
[0205] 50, 250, 350 Posterior distribution unifying unit
[0206] 51 Determining unit
[0207] 52 Unified posterior distribution storage unit
[0208] 60 Output unit
[0209] 1001 CPU
[0210] 1002 RAM
[0211] 1003 ROM
[0212] 1004 Storage device
[0213] 1005 Input device
[0214] 1006 Output device