SPATIAL DATA UPSAMPLING METHOD, SPATIAL DATA UPSAMPLING APPARATUS AND PROGRAM

20220405872 · 2022-12-22

Assignee

Inventors

Cpc classification

International classification

Abstract

A spatial data downscaling method executed by a computer including a memory and a processor, includes acquiring point data where a point in a geographical space and a value at the point are associated with each other and region data in which a region in the geographical space and a value in the region are associated with each other as training data; estimating, with the training data acquired in the acquiring, parameters of a multivariate Gaussian process model represented by a linear mixture of a plurality of latent Gaussian processes; and calculating resolution enhance data in which a region having a finer granularity than the region and a value in the region having the finer granularity are associated with each other from the region data designated by a user with the multivariate Gaussian process model in which the parameters estimated in the estimating have been set.

Claims

1. A spatial data downscaling method executed by a computer including a memory and a processor, the method comprising: acquiring point data where a point in a geographical space and a value at the point are associated with each other and region data in which a region in the geographical space and a value in the region are associated with each other as training data; estimating, with the training data acquired in the acquiring, parameters of a multivariate Gaussian process model represented by a linear mixture of a plurality of latent Gaussian processes; and calculating resolution enhance data in which a region having a finer granularity than the region and a value in the region having the finer granularity are associated with each other from the region data designated by a user with the multivariate Gaussian process model in which the parameters estimated in the estimating have been set.

2. The spatial data downscaling method according to claim 1, wherein the estimating includes estimating a spatial scale parameter, a mixing coefficient, a residual variance parameter, and a noise variance parameter as the parameters of the multivariate Gaussian process model.

3. The spatial data downscaling method according to claim 1, wherein the estimating comprises representing a value of the region data using a realization value of a Gaussian distribution having an integrated value of a Gaussian process in the region as an average based on the multivariate Gaussian process model, representing a value of the point data using a realization value of a Gaussian distribution having a value of a Gaussian process at the point as an average, and estimating the parameters by maximum likelihood estimation.

4. The spatial data downscaling method according to claim 1, wherein the geographical space includes a plurality of geographical spaces representing a plurality of different cities.

5. A spatial data downscaling apparatus comprising: a memory; and a processor configured to execute acquiring point data where a point in a geographical space and a value at the point are associated with each other and region data in which a region in the geographical space and a value in the region are associated with each other as training data; estimating, with the training data acquired in the acquiring, parameters of a multivariate Gaussian process model represented by a linear mixture of a plurality of latent Gaussian processes; and calculating resolution enhance data in which a region having a finer granularity than the region and a value in the region having the finer granularity are associated with each other from the region data designated by a user with the multivariate Gaussian process model in which the parameters estimated in the estimating have been set.

6. A non-transitory computer-readable recording medium having computer-readable instructions stored thereon, which when executed, cause a computer including a memory and a processor to execute a method comprising: acquiring point data where a point in a geographical space and a value at the point are associated with each other and region data in which a region in the geographical space and a value in the region are associated with each other as training data; estimating, with the training data acquired in the acquiring, parameters of a multivariate Gaussian process model represented by a linear mixture of a plurality of latent Gaussian processes; and calculating resolution enhance data in which a region having a finer granularity than the region and a value in the region having the finer granularity are associated with each other from the region data designated by a user with the multivariate Gaussian process model in which the parameters estimated in the estimating have been set.

Description

BRIEF DESCRIPTION OF DRAWINGS

[0015] FIG. 1 is a diagram illustrating an example of an overall configuration of a spatial data downscaling apparatus according to the present embodiment.

[0016] FIG. 2 is a diagram illustrating an example of a hardware configuration of a spatial data downscaling apparatus according to the present embodiment.

[0017] FIG. 3 is a flowchart illustrating an example of parameter estimation processing according to the present embodiment.

[0018] FIG. 4 is a flowchart illustrating an example of spatial data resolution enhancing processing according to the present embodiment.

[0019] FIG. 5 is a diagram illustrating an example of a spatial data resolution enhance screen.

DESCRIPTION OF EMBODIMENTS

[0020] Hereinafter, embodiments of the present disclosure will be described. In the present embodiment, a spatial data downscaling apparatus 10 capable of enhancing resolution of spatial data with high accuracy will be described.

[0021] Here, the spatial data downscaling apparatus 10 according to the present embodiment can estimate unknown quantities (parameters) of a multivariate Gaussian process model that takes a spatial correlation between a point and a point, between a point and a region, and between a region and a region into account when spatial data including point data and region data is given. Thus, the spatial data downscaling apparatus 10 according to the present embodiment can enhance the resolution of spatial data of a target with high accuracy using the multivariate Gaussian process model in which the estimated parameters have been used.

[0022] Further, the spatial data downscaling apparatus 10 according to the present embodiment can estimate the parameters of the multivariate Gaussian process model even when spatial data in different spaces (for example, spatial data in a plurality of cities) is given. Thus, the spatial data downscaling apparatus 10 according to the present embodiment can utilize spatial data of other cities to estimate the parameters, for example, even when the number of types of spatial data in a certain city is small.

[0023] As described above, the spatial data refers to data represented by a pair of position information (for example, latitude and longitude, an address, and an area) and some value associated with the position information. Further, when the position information is a point such as latitude and longitude (that is, when the spatial data is represented by a pair of a point and some value associated with the point), the spatial data is also represented as point data. On the other hand, when the position information is any region with a geospatial extent (that is, when the spatial data is represented by a pair of a region and some value associated with the region), the spatial data is also referred to as region data. The region data can also be said to be spatial data in which some values are integrated in any region having a certain geospatial extent.

[0024] Further, enhancing the resolution of spatial data means, with region data represented by a pair of a certain granularity region and a value associated with the region, calculating the spatial data represented by a pair of a finer granularity region and a value associated with the region. For example, enhancing the resolution of the spatial data means, with region data representing a population in a certain prefecture. calculating region data representing a population in each city in the prefecture.

[0025] In the following embodiment, a case in which the parameters of the multivariate Gaussian process model are estimated mainly assuming that a plurality of types of point data and region data in a certain city are given, and the resolution of a target type of region data is enhanced will be described. Further, in this description, a case in which the parameters of the multivariate Gaussian process model are estimated when a plurality of types of point data and region data in a plurality of cities are given will also be described. The type of spatial data (point data and region data) is a type of information represented by the value associated with the point or region, and examples thereof include a degree of poverty, a degree of air pollution, a number of crimes, a population, and a traffic volume.

[0026] Overall Configuration

[0027] First, an overall configuration of the spatial data downscaling apparatus 10 according to the present embodiment will be described with reference to FIG. 1. FIG. 1 is a diagram illustrating an example of the overall configuration of the spatial data downscaling apparatus 10 according to the present embodiment.

[0028] As illustrated in FIG. 1, the spatial data downscaling apparatus 10 according to the present embodiment includes a resolution enhancing processing unit 101, an acquisition unit 102, an operation reception unit 103, and an output unit 104. Further, the spatial data downscaling apparatus 10 according to the present embodiment includes a point data storage unit 111, a region data storage unit 112, a parameter storage unit 113, and a target division storage unit 114.

[0029] The point data storage unit 111 stores a plurality of items of point data. The point data stored in the point data storage unit 111 includes a plurality of types of point data.

[0030] Here, a set representing an entire input space is set as X ⊂ R (R: the set of all real numbers) and x ∈ X is set as an input variable. For example, X corresponds to the entire city and x corresponds to latitude and longitude. s=1, . . . , S.sub.0 is an argument indicating a type of point data, and n=1, . . . , N.sub.s is an argument indicating the number of items of point data of type s (that is, the number of sample points). Further, an n-th sample point of the point data of the type s is x.sub.s,n, and the n-th point data of the type s is represented as a set (x.sub.s,n, y.sub.s,n) of the sample point x.sub.s,n and the value y.sub.s,n ∈ R. Thus, the point data stored in the point data storage unit 111 is {(x.sub.s,n, y.sub.s,n)|s=1, . . . , S.sub.0; n=1, . . . , N.sub.s}. (x.sub.s,n, y.sub.s,n) means that an n-th observation y.sub.s,n of the type s has been obtained at the point x.sub.s,n.

[0031] The region data storage unit 112 stores a plurality of items of region data. A plurality of types of region data are included in the region data stored in the region data storage unit 112.

[0032] Here, s=S.sub.0+1, . . . , S is an argument indicating a type of region data, and P.sub.s is an argument indicating division of the region data of type s. The division is a division when a specific geographical space is divided into a plurality of geographical regions, and is, for example, a division of a city by an address or area. Further, |P.sub.s| indicates the number of regions included in the division P.sub.s. An n-th region is R.sub.s,n ∈ P.sub.s for an argument n=1, . . . , |P.sub.s| indicating the region. Further, the n-th region data of the type s is represented by a set (R.sub.s,n, y.sub.s,n) of the region R.sub.s,n and the value y.sub.s,n ∈ R. Thus, the region data stored in the region data storage unit 112 is {(R.sub.s,n, y.sub.s,n)|s=S.sub.0+1, . . . , S; n=1, . . . , |P.sub.s|}. (R.sub.s,n, y.sub.s,n) means that an n-th observation y.sub.s,n of the type s has been obtained in the region R.sub.s,n.

[0033] The parameter storage unit 113 stores parameters (parameters of the multivariate Gaussian process model) estimated by a parameter estimation unit 105 that will be described below. That is, trained parameters of the multivariate Gaussian process model are stored in the parameter storage unit 113. As will be described below, the parameters to be estimated are a spatial scale parameter, a mixing coefficient, a residual variance parameter, and a noise variance parameter.

[0034] The target division storage unit 114 stores a target division P.sup.target representing division after the enhance in the resolution. Further, one region included in the target division P.sup.target is represented R.sup.target. Here, any target division P.sup.target can be used. For example, it is conceivable that the target division P.sup.target is, for example, division by an address, area, or the like, or division by a mesh having a size randomly set by a user.

[0035] The resolution enhancing processing unit 101 performs estimation of the parameters of the multivariate Gaussian process model and calculation of resolution enhance data obtained by enhancing the resolution of a target type of region data.

[0036] The acquisition unit 102 acquires the point data and the region data from the point data storage unit 111 and the region data storage unit 112 as training data that is used for parameter estimation of the multivariate Gaussian process model.

[0037] The operation reception unit 103 receives a user operation for designating a target type of region data and a target division (that is, the target division P.sup.target). A resolution of the target type of region data (that is, a granularity of the region of each item of region data of the type) is coarser than that of the target division P.sup.target.

[0038] The output unit 104 outputs resolution enhance data to a predetermined output destination. The output destination can be any output destination; as examples, display on a display, printing with a printer, sound wave output from a speaker, transmission to an external device connected via a communication network, or the like can be considered.

[0039] Here, the resolution enhancing processing unit 101 includes the parameter estimation unit 105 and a resolution enhance data calculation unit 106.

[0040] The parameter estimation unit 105 estimates, with the point data and the region data acquired by the acquisition unit 102, the parameters (the spatial scale parameter, the mixing coefficient, the residual variance parameter, and the noise variance parameter) of the multivariate Gaussian process model. Hereinafter, the multivariate Gaussian process model and a method for estimating the parameters thereof will be described.

[0041] First, formulation of a multivariate Gaussian process model represented by a linear mixture of a plurality of latent Gaussian processes is performed. L independent Gaussian processes are defined as:


[Math. 1]


g.sub.l(xcustom-character(0, γ.sub.l(x, x′)), l=1, . . . , L   (1)

Here, γ.sub.l(x, x′): X×X.fwdarw.R is a correlation function of a first Gaussian process, and any function can be used. In the present embodiment,

[00001] [ Math . 2 ] γ l ( x , x ) = exp ( - 1 2 β l 2 .Math. x - x .Math. 2 ) ( 2 )

is used as this correlation function. Here, β.sub.l is a spatial scale parameter of the first correlation function. A total number of types of point data and region data (in other words, the number of data sets for each type) is S. For s=1, . . . , S.sub.0, S.sub.0+1, . . . , S, f.sub.s(x) is a noiseless latent Gaussian process for s-th spatial data (that is, the point data of the type s when s=1, . . . , S.sub.0, and the region data of the type s when s=S.sub.0+1, . . . , S). An S variate Gaussian process


f(x)=(f.sub.l(x), . . . , f.sub.S(x)).sup.τ  [Math. 3]

is a linear mixture of L independent Gaussian processes, and is represented as


[Math. 4]


f(x)=Wg(x)+n(x)   (3)


Here,


g(x)=(g.sub.l(x), . . . , g.sub.L(x)).sup.τ  [Math. 5]

is represented, W is a mixed matrix of S×L, and w.sub.s,l ∈ R that is an (s, l) element represents a mixing coefficient. Further, n(x) is a Gaussian process having an average 0 of an S variate, and is represented as


[Math. 6]


n(x)˜custom-character(0, Λ(x, x′))   (4)


Here,


0   [Math. 7]

is an S dimensional vector with all elements of 0, and Λ(x, x′) is


[Math. 8]


Λ(x, x′)=diag(λ.sub.1(x, x′), . . . , λ.sub.S(x, x′))   (5)

λ.sub.s(x, x′): X×X.fwdarw.R is a correlation function for the s-th spatial data, and any function can be used. For simplicity in the present embodiment, with a Dirac delta function δ(⋅)


[Math. 9]


λ.sub.s(x, x′)=λ.sub.s.sup.2δ(x−x′)   (6)

is obtained. λ.sub.s.sup.2 is a residual variance parameter for an s-th Gaussian process. g(x) in Equation (3) above can be integrated out, and as a result, the S variate Gaussian process can be written as


[Math. 10]


f(x)˜custom-character(0, K(x, x′))   (7)

Here, K(x, x′): X×X.fwdarw.R.sup.S×S represents a correlation matrix.


[Math. 11]


K(x, x′)=WΓ(x, x′)W.sup.τ+Λ(x, x′)   (8)

is obtained. Here, Γ(x, x′)=diag(γ.sub.l(x, x′), . . . , γ.sub.L(x, x′)). Further, a (s, s′) element of K(x, x′) is given as

[00002] [ Math . 12 ] k s , s ( x , x ) = δ s , s λ s ( x , x ) + .Math. l = 1 L 𝓌 s , I w s , I γ I ( x , x ) . ( 9 )

Here, δ⋅,⋅ is a Kronecker delta function, and δ.sub.A, B=1 is output when A=B, and δ.sub.A, B=0 is output otherwise.

[0042] Then, a value of the region data is represented by a realization value of the Gaussian distribution having an integrated value of the Gaussian process in the region (that is, the region associated with this value) as an average, and a value of the point data is represented by the realization value of the Gaussian distribution having a value of the Gaussian process at the point (that is, a point associated with this value) as an average. An N.sub.s dimensional observation vector generated from the s-th Gaussian process for s=1, . . . , S.sub.0 is


y.sub.s=(y.sub.s,l, . . . , y.sub.s,N.sub.s).sup.τ.   [Math. 13]

Further, a |P.sub.s| dimensional observation vector generated from the s-th Gaussian process for s=S.sub.0+1, . . . , S is


y.sub.s=(y.sub.s,l, . . . , y.sub.s, custom-character.sub.s|).sup.τ.   [Math. 14]

Observation vectors generated from S Gaussian processes are collectively represented as

[00003] [ Math . 15 ] y = ( y 1 .Math. y S 0 y S 0 + 1 .Math. y S ) . ( 10 )

It is assumed that y follows a multidimensional Gaussian distribution


[Math. 16]


y|f(x)˜custom-character(y|∫.sub.xA(x)f(x)dx,Σ)   (11)

Here,

[0043]
N=Σ.sub.s=1.sup.S.sup.0N.sub.s+Σ.sub.s=S.sub.0.sub.+1|custom-character|,   [Math. 17]

and A(x): x.fwdarw.R.sup.N×S is


[Math. 18]


A(x)=diag(a.sub.1(x), . . . , a.sub.S.sub.0(x), a.sub.S.sub.0.sub.+1(x), . . . , a.sub.S(x))   (12)

When s=1, . . . , S.sub.0,


a.sub.s(x)=(a.sub.s,1(x), . . . , a.sub.s,N.sub.s(x)).sup.τ  [Math. 19]

and when s=S.sub.0+1, . . . , S,


a.sub.s(x)=(a.sub.s,1(x), . . . , custom-character(x)).sup.τ.   [Math. 20]

Any a.sub.s,n(x) can be used, and a method of integrating in each region can be changed using a method of setting a.sub.s,n(x). In the present embodiment, a case in which an observation at a point is obtained when s=1, . . . , S.sub.0, and a result of region-averaging in each region R.sub.s,n is obtained as an observation when s=S.sub.0+1, . . . , S is considered. In this case, a.sub.s,n(x) can be written as

[00004] [ Math . 21 ] a s , n ( x ) = { ( x = x s , n ) if s = 1 , .Math. , S 0 ( x s , n ) 𝒳 ( x s , n ) dx if s = S 0 + 1 , .Math. , S . ( 13 )

Here,

[0044]
custom-character(.circle-solid.)   [Math. 22]

is an indicator function, and when C is true,


custom-character(C)=1   [Math. 23]

is output, and otherwise,


custom-character(C)=0   [Math. 24]

is output. Further,

[00005] [ Math . 25 ] .Math. = ( σ 1 2 I O O σ 2 2 I .Math. .Math. O O .Math. .Math. .Math. O O .Math. σ S 2 I ) , ( 14 )

and σ.sub.s.sup.2 is a noise variance parameter of the s-th Gaussian process. Here, I is an identity matrix, and O is a matrix having 0 as elements. Parameters to be estimated by the parameter estimation unit 105 are a spatial scale parameter β={β.sub.l|l=1, . . . , L}, a mixing matrix W (that is, a mixing coefficient {w.sub.s,l,|s=1, . . . , S, l=1, . . . , L}) which is an element thereof), a residual variance parameter Λ={λ.sub.s|s=1, . . . , S}, and a noise variance parameter Σ.

[0045] Next, a method of training (estimating) various parameters (the spatial scale parameter, the mixing coefficient, the residual variance parameter, and the noise variance parameter) using maximum likelihood estimation will be described. When the observation y is given, by integrating out f(x), a marginal likelihood can be written as


[Math. 26]


p(y)=custom-character(y|0, C)   (15)

Here, C is an N×N correlation matrix, and can be written as

[00006] [ Math . 40 ] C = 𝒳 × 𝒳 A ( x ) K ( x , x ) A ( x ) T dxdx + .Math. ( 16 ) = ( C 1 , 1 C 1 , 2 .Math. C l , S C 2 , 1 C 2 , 2 .Math. C 2 , S .Math. .Math. .Math. C S , 1 C S , 2 .Math. C S , S ) . ( 17 )

C.SUB.s,s′ .is

[0046]
[Math. 28]


C.sub.s,s′=∫∫.sub.x×xk.sub.s,s′(x, x′)a.sub.s(x)a.sub.s′(x′).sup.τdxdx′+δ.sub.s,s′σ.sub.sI   (18)

Because it is difficult to analytically calculate a regional integration in Equation (18) above, discrete approximation is performed to calculate the regional integration in the present embodiment. First, an input space X is divided into sufficiently fine grids, and a set of grid points included in the regions R.sub.s,n is set as


custom-character.sub.s,n.   [Math. 29]

Thus, each component of C.sub.s,s′ can be approximated to

[00007] [ Math . 30 ] C s , s ( n , n ) { k s , s ( x s , n , x s , n ) + δ s , s σ s if s = 1 , .Math. , S 0 ; s = 1 , .Math. , S 0 1 .Math. "\[LeftBracketingBar]" 𝒢 s , n .Math. "\[RightBracketingBar]" .Math. j 𝒢s , n k s , s ( x s , n , j ) if s = 1 , .Math. , S 0 ; s = S 0 + 1 , .Math. , S 1 .Math. "\[LeftBracketingBar]" 𝒢 s , n .Math. "\[RightBracketingBar]" .Math. i 𝒢s , n k s , s ( i , x s , n ) if s = S 0 + 1 , .Math. , S ; s = 1 , .Math. , S 0 1 .Math. "\[LeftBracketingBar]" 𝒢 s , n .Math. "\[RightBracketingBar]" 1 .Math. "\[LeftBracketingBar]" 𝒢 s , n .Math. "\[RightBracketingBar]" .Math. i 𝒢 s , n .Math. j 𝒢s , n k s , s ( i , j ) + δ s , s σ s if s = S 0 + 1 , .Math. , S ; s = S 0 + 1 , .Math. , S . ( 19 )

The first row of Equation (19) above represents “covariance between a point and a point,” the second and third rows represent “covariance between a point and a region,” and the fourth row represents “covariance between a region and a region.” By taking a logarithm of the marginal likelihood shown Equation (15) to extract only terms regarding the parameters to be estimated,

[00008] [ Math . 31 ] log p ( y | W , β , Λ , Σ ) - 1 2 y T C - 1 y - 1 2 log ( det ( C ) ) ( 20 )

is obtained. By maximizing Equation (20) above, maximum likelihood estimation solutions for various parameters can be obtained. An optimization problem for maximizing Equation (20) above can be solved by using, for example, a Broyden-Fletcher-Goldfarb-Shanno (BFGS) method. For the BFGS method, for example, a reference “D. C. Liu and J. Nocedal. On the limited memory BFGS method for large scale optimization. Mathematical Programming, 45 (1-3): 503-528, 1989,” can be referred to.

[0047] A parameter estimation method in a case in which a plurality of types of point data and region data in a plurality of cities are given will be described here. Hereafter, it is assumed that there are V types of cities. In this case, it is assumed that the point data and the region data in each city follow a conditionally independent probability distribution when a common latent Gaussian process {g.sub.l(x)|l=1, . . . , L} and the mixing matrix W are given. Thus, the marginal likelihood when a plurality of types of point data and region data in a plurality of cities are given can be written as

[00009] [ Math . 32 ] p ( y ( 1 ) , y ( 2 ) , .Math. , y ( V ) ) = .Math. v = 1 V 𝒩 ( y ( v .Math. | 0 , C ( v ) ) . ( 21 )

Here, y.sup.(v) is an observation vector in a city of type v, and C.sup.(v) is a correlation matrix for the city of type v. By maximizing Equation (21) above in substantially the same procedure as in the case of one city, maximum likelihood estimation solutions for various parameters can be obtained.

[0048] Further, the resolution enhance data calculation unit 106, with the type of region data designated by the user operation received by the operation reception unit 103 and the target division P.sup.target, calculates the resolution enhance data obtained by enhancing the resolution of the region data of this type into the target division P.sup.target. Hereinafter, a method of calculating the resolution enhance data will be described.

[0049] First, a post-process of the S variate Gaussian process f(x) is derived, in which f(x) is the S variate Gaussian process in which various parameters estimated by the parameter estimation unit 105 have been set. This post-process f*(x) can be written as


[Math. 33]


f*(x)˜custom-character(m*(x), K*(x, x′))   (22)

Here, m*: X.fwdarw.R.sup.S represents an average vector, and K*(x, x′): X×X.fwdarw.R.sup.S×S represents a correlation matrix.

[0050] Further, H(x): X.fwdarw.R.sup.N×S is put as

[00010] [ Math . 34 ] H ( x ) = X A ( x ) K ( x , x ) dx ( 23 ) = ( h 1 , 1 ( x ) h 1 , 2 ( x ) h 2 , 1 ( x ) h 2 , 2 ( x ) .Math. .Math. h S , 1 ( x ) h S , 2 ( x ) .Math. .Math. .Math. h 1 , S ( x ) h 2 , S ( x ) .Math. h S , S ( x ) ) . ( 24 )

Here,

[0051]
[Math. 35]


h.sub.s,s′(x)=∫.sub.xa.sub.s(x′)k.sub.s,s′(x′, x)dx′  (25)

Because it is difficult to analytically calculate a regional integration in Equation (25) above, a discrete approximation is performed as in Equation (19) above. Thus, a set of grid points, with


custom-character.sub.s,n   [Math. 36]

an n-th element of h.sub.s,s′(x) can be calculated as

[00011] [ Math . 37 ] { k s , s ( x s , n , x ) if s = 1 , .Math. , S 0 1 .Math. "\[LeftBracketingBar]" s , n .Math. "\[RightBracketingBar]" .Math. i s , n k s , s ( i , x ) if s = S 0 + 1 , .Math. , S . ( 26 )

Thus, by using H(x), m*(x) and K*(x, x′) are expressed as


[Math. 38]


m*(x)=m(x)+H(x).sup.τC.sup.−1(y−μ),   (27)


K*(x, x′)=K(x, x′)−H(x).sup.τC.sup.−1H(x)   (28)

[0052] In this case, the resolution enhance data to be calculated is obtained by integrating a posterior average shown in Equation (27) above in each region in the target division P.sup.target. Now consider calculating a predicted value in the region R.sup.target included in the target division P.sup.target (that is, a value associated with this region R.sup.target) with an argument indicating the type (that is, the target type of region data) designated by the user operation received by the operation reception unit 103 as s. Denoting an s-th element of a posterior average m*(x) as m.sub.s*(x), a predicted value to be obtained is

[00012] [ Math . 39 ] χ a t a r g e t ( x ) m s * ( x ) dx .Math. i 𝒢 t a r g e t a t a r g e t ( i ) m s * ( i ) ( 29 ) 1 | 𝒢 t a r g e t | .Math. i 𝒢 t a r g e t m s * ( i ) . ( 30 )

Here, a.sup.target(x) is

[00013] [ Math . 40 ] a t a r g e t ( x ) = ( x t a r g e t ) 𝒳 ( x t a r g e t ) dx . ( 31 )

For the regional integration in Equation (29) above, discrete approximation is performed, and a set of grid points included in the region R.sup.target is obtained as


custom-character.sup.target   [Math. 41]

By repeating the calculation process of Equations (29) to (31) above for each region R.sup.target included in the target division P.sup.target, desired resolution enhance data can be obtained.

[0053] Hardware Configuration

[0054] Next, a hardware configuration of the spatial data downscaling apparatus 10 according to the present embodiment will be described with reference to FIG. 2. FIG. 2 is a diagram illustrating an example of the hardware configuration of the spatial data downscaling apparatus 10 according to the present embodiment.

[0055] As illustrated in FIG. 2, the spatial data downscaling apparatus 10 according to the present embodiment is implemented by a general computer or a computer system, and includes an input device 201, a display device 202, an external I/F 203, a communication I/F 204, a processor 205, and a memory device 206. These hardware components are communicably connected via a bus 207.

[0056] The input device 201 is, for example, a keyboard, a mouse, or a touch panel. The display device 202 is, for example, a display. The spatial data downscaling apparatus 10 may or may not include at least one of the input device 201 and the display device 202.

[0057] The external I/F 203 is an interface with an external device. Examples of the external device include a recording medium 203a and the like. The spatial data downscaling apparatus 10 can perform reading from or writing to the recording medium 203a via the external I/F 203. One or more programs that implement respective functional units (the resolution enhancing processing unit 101, the acquisition unit 102, the operation reception unit 103, the output unit 104, and the like) included in the spatial data downscaling apparatus 10, for example, may be stored in the recording medium 203a.

[0058] Examples of the recording medium 203a include a compact disc (CD), a digital versatile disk (DVD), a Secure Digital memory card (SD memory card), and a Universal Serial Bus (USB) memory card.

[0059] The communication I/F 204 is an interface for connecting the spatial data downscaling apparatus 10 to a communication network. One or more programs that implement respective functional units of the spatial data downscaling apparatus 10 may be acquired (downloaded) from a predetermined server apparatus or the like via the communication I/F 204.

[0060] The processor 205 is, for example, various calculation devices such as a central processing unit (CPU) or a graphics processing unit (GPU). Each functional unit included in the spatial data downscaling apparatus 10 is implemented by one or more programs stored in the memory device 206 or the like causing the processor 205 to execute processing.

[0061] The memory device 206 is, for example, any storage device such as a hard disk drive (HDD), a solid state drive (SSD), a random access memory (RAM), a read only memory (ROM), or a flash memory. Each storage unit (the point data storage unit 111, the region data storage unit 112, the parameter storage unit 113, the target division storage unit 114, and the like) included in the spatial data downscaling apparatus 10 can be implemented by using the memory device 206. For example, at least one of the point data storage unit 111, the region data storage unit 112, and the target division storage unit 114 may be implemented by using, for example, a storage device (for example, a database server) connected to the spatial data downscaling apparatus 10 via a communication network.

[0062] The spatial data downscaling apparatus 10 according to the present embodiment can implement parameter estimation processing and spatial data resolution enhancing processing to be described below by having the hardware configuration illustrated in FIG. 2. The hardware configuration illustrated in FIG. 2 is an example, and the spatial data downscaling apparatus 10 may have another hardware configuration. For example, the spatial data downscaling apparatus 10 may include a plurality of processors 205 or may include a plurality of memory devices 206.

[0063] Flow of Parameter Estimation Processing

[0064] Next, a flow of parameter estimation processing for estimating the parameters of the multivariate Gaussian process model will be described with reference to FIG. 3. FIG. 3 is a flowchart illustrating an example of the parameter estimation processing according to the present embodiment.

[0065] First, the acquisition unit 102 acquires the point data and the region data as training data from the point data storage unit 111 and the region data storage unit 112 (step S101).

[0066] Next, the parameter estimation unit 105 of the resolution enhancing processing unit 101 maximizes the marginal likelihood shown in Equation (15) above with the training data acquired in step S101 as the observation y to estimate the parameters of the S variate Gaussian process model (step S102). If a plurality of types of point data and region data in a plurality of cities are acquired as training data as described above, the parameter estimation unit 105 of the resolution enhancing processing unit 101 maximizes the marginal likelihood shown in Equation (21) above to estimate the parameters of the S variate Gaussian process model.

[0067] The parameter estimation unit 105 of the resolution enhancing processing unit 101 stores various parameters estimated (trained) in step S102 above in the parameter storage unit 113 (step S103).

[0068] Flow of Spatial Data Resolution Enhance Processing

[0069] Next, a flow of the spatial data resolution enhancing processing for calculating the resolution enhance data to be obtained by enhancing the resolution of the target type of region data will be described with reference to FIG. 4. FIG. 4 is a flowchart illustrating an example of the spatial data resolution enhancing processing according to the present embodiment.

[0070] First, the operation reception unit 103 receives a user operation for designating the type s of target region data and the target division P.sup.target (step S201). Here, for example, the user can input or select a desired type s and a desired target division P.sup.target in a region data type designation field 1001 and a target division designation field 1002 included in a spatial data resolution enhance screen 1000 illustrated in FIG. 5 to designate the division. In the region data display field 1003 included in the spatial data resolution enhance screen 1000 illustrated in FIG. 5, for example, a region included in each item of region data of the type s designated in the region data type designation field 1001 is displayed using a color (or a shade of a color, or the like) according to the value associated with the region.

[0071] Next, the resolution enhance data calculation unit 106 of the resolution enhancing processing unit 101, with the type s designated in step S201 and the target division P.sup.target, calculates the resolution enhance data with the S variate Gaussian process model in which the parameters (i.e., trained parameters) stored in the parameter storage unit 113 have been set (step S202). That is, the resolution enhance data calculation unit 106 calculates the resolution enhance data obtained by enhancing the resolution of the region data of the type s into the region R.sup.target included in the target division P.sup.target using Equations (29) to (31) above.

[0072] The output unit 104 outputs the resolution enhance data calculated in step S202 above (step S203). For example, the output unit 104 displays a region included in the resolution enhance data calculated in step S202 above in the resolution enhance data display field 1004 included in the spatial data resolution enhance screen 1000 illustrated in FIG. 5 using color (or shade of color, or the like) according to the value (predicted value) associated with the region. Thus, it is possible to visualize the target type of region data and the resolution enhance data obtained by enhancing the resolution of this region data, and the user can narrow down, for example, regions with a high degree of poverty or regions with a higher degree of air pollution in detail and take more detailed measures.

[0073] Conclusion

[0074] As described above, the spatial data downscaling apparatus 10 according to the present embodiment estimates the parameters of the multivariate Gaussian process model that takes the spatial correlation between a point and a point, between a point and a region, and between a region and a region into account when spatial data including point data and region data is given. That is, the spatial data downscaling apparatus 10 according to the present embodiment sets the “spatial scale parameter,” the “mixing coefficient,” the “residual variance parameter,” and the “noise variance parameter” as unknown quantities, and by using the maximum likelihood estimation, estimates the unknown quantities of the Gaussian process model from the region data and the point data. This estimation is performed by representing the value of the region data using the realization value of the Gaussian distribution having the integrated value of the Gaussian process in the region as an average based on a multivariate Gaussian process model represented by a linear combination of a plurality of latent Gaussian processes, and representing the value of the point data using the realization value of the Gaussian distribution having the value of the Gaussian process at the point as an average.

[0075] Thus, the spatial data downscaling apparatus 10 according to the present embodiment brings effects shown in the following (1) to (3).

[0076] (1) By modeling a plurality of items of spatial data at the same time based on the multivariate Gaussian process model represented by the linear combination of a plurality of latent Gaussian processes, “spatial scale parameters” can be shared among the plurality of items of spatial data and trained. Thus, even if there is low-resolution spatial data, a plurality of items of spatial data can be effectively utilized to calculate (predict) the resolution enhance data.

[0077] (2) If there is spatial data in a plurality of cities (that is, a plurality of entire spaces), by modeling a plurality of items of spatial data at the same time based on the multivariate Gaussian process model represented by the linear combination of a plurality of latent Gaussian processes, “spatial scale parameters” and the “mixing coefficient” can be shared among the plurality of cities and the plurality of items of spatial data and trained. Thus, even if the number of types of spatial data in a certain city is small, spatial data of other cities is effectively utilized to calculate the resolution enhance data.

[0078] (3) By representing the value of the region data using the realization value of the Gaussian distribution having the integrated value of the Gaussian process in the region as an average, and representing the value of the point data using the realization value of the Gaussian distribution having the value of the Gaussian process at the point as an average, the spatial correlation between a point and a point, a point and a region, and a region and a region can be accurately evaluated while regions having various sizes or shapes are taken into account, and estimation of unknown quantities of the Gaussian process model can be accurately performed.

[0079] The present invention is not limited to the specific embodiment described above, and various modifications or changes, combinations with known technologies, and the like can be made without departing from the description of the claims.

REFERENCE SIGNS LIST

[0080] 10 Spatial data downscaling apparatus [0081] 101 Resolution enhancing processing unit [0082] 102 Acquisition unit [0083] 103 Operation reception unit [0084] 104 Output unit [0085] 105 Parameter estimation unit [0086] 106 Resolution enhance data calculation unit [0087] 111 Point data storage unit [0088] 112 Region data storage unit [0089] 113 Parameter storage unit [0090] 114 Target division storage unit