SPATIAL PREDICTION METHOD OF RICE STABLE ISOTOPE BASED ON ENVIRONMENTAL SIMILARITY
20230306076 · 2023-09-28
Assignee
Inventors
- Yuwei Yuan (Hangzhou, CN)
- Meiling Sheng (Hangzhou, CN)
- Chunlin Li (Hangzhou, CN)
- Yongzhi Zhang (Hangzhou, CN)
- Jing Nie (Hangzhou, CN)
- Shengzhi Shao (Hangzhou, CN)
Cpc classification
G06F17/12
PHYSICS
International classification
Abstract
A spatial prediction method of rice stable isotope based on environmental similarity is provided. The method comprises: describing environmental characteristics of the rice stable isotope; measuring an environmental similarity between a site to be predicted and a sample site; measuring a reliability of the sample site; and carrying out a spatial prediction on the rice stable isotope according to the environmental similarity between the site to be predicted and the sample site and the reliability of the sample site. This method can improve the accuracy of the prediction result.
Claims
1. A spatial prediction method of rice stable isotope based on environmental similarity, comprising: (1) describing environmental characteristics of the rice stable isotope, and screening factors that have great influence on the rice stable isotope as auxiliary variables in prediction process; (2) measuring an environmental similarity between a site to be predicted and a sample site, wherein a similarity for a single factor between the site to be predicted and the sample site is calculated, and similarity for each influencing factor is synthesized by using a weighted average method to obtain a value of the environmental similarity between the site to be predicted and the sample site; (3) measuring a reliability of the sample site, wherein the reliability of the sample site is calculated by using an environmental similarity between sample sites and a similarity for a target variable; and (4) carrying out a spatial prediction on the rice stable isotope according to the environmental similarity between the site to be predicted and the sample site and the reliability of the sample site.
2. The spatial prediction method according to claim 1, wherein step (2) comprises: calculating the environmental similarity for the single factor by using a Gower similarity calculation method, which is expressed as:
3. The spatial prediction method according to claim 2, wherein the environmental similarity between the site to be predicted and the sample site is synthesized by using the weighted average method, and a calculation formula is as follows:
4. The spatial prediction method according to claim 1, wherein in step (3), the environmental similarity between the sample sites and the similarity for the target variable are calculated, and a threshold parameter (p.sub.1) of the environmental similarity and a threshold parameter (p.sub.2) of the similarity for the target variable are set, and the relationship between the sample sites is determined.
5. The spatial prediction method according to claim 4, wherein as supporting sample sites increase, the reliability increases; as contradictory sample sites increases, the reliability reduces; if the sample site has only a contradictory sample site but no a supporting sample site, the reliability of the sample site is 0; and if there is neither a supporting sample site nor a contradiction site, the reliability of the sample site is unknown, which is set to a null value “NoData”.
6. The spatial prediction method according to claim 5, wherein a calculation formula of the reliability of the sample site is as follows:
7. The spatial prediction method according to claim 1, wherein in step (4), calculating a stable isotope value of the site to be predicted based on steps (2) and (3) comprises: calculating a value of each site to be predicted by using the weighted average method, and a formula is as follows:
8. The spatial prediction method according to claim 7, wherein a calculation formula of an uncertainty of the prediction is as follows:
U.sub.j=1−max(S.sub.j1×r.sub.1,S.sub.j2×r.sub.2, . . . ,S.sub.jn×r.sub.n), wherein n is a number of sample sites, and S.sub.jn is an environmental similarity of a sample site set for the prediction; r.sub.n is the reliability of the sample site, and as the similarity and the reliability increases, the uncertainty value of the prediction reduces.
9. The spatial prediction method according to claim 1, wherein the environmental similarity between the site to be predicted and the sample site is synthesized by using the weighted average method, and a calculation formula is as follows:
Description
BRIEF DESCRIPTION OF THE DRAWINGS
[0018]
[0019]
DETAILED DESCRIPTION OF THE EMBODIMENTS
[0020] In order to make the content of the present disclosure more readily understandable, the content of the present disclosure will be described in detail in conjunction with specific embodiments and drawings hereinafter.
[0021] According to the third law of geography that “the more similar the geographic configurations of two points(areas), the more similar the values (processes) of the target variable at these two points (areas)”, the more similar the site of the geographical environment is, the closer the rice stable isotope value is. Based on this theory, the spatial distribution of rice stable isotope is predicted according to the similarity between two sites.
[0022] As shown in
[0024] Correlation analysis is used to screen the factors having high correlation with rice stable isotope (δ.sup.13C, δ.sup.2H and δ.sup.18O) as auxiliary variables in the prediction process, forming an influencing factor database. [0025] (2) Environmental similarity between a site to be predicted and a sample site is measured.
[0026] A similarity for single factor between the site to be predicted and the sample site is calculated using a Gower similarity calculation method, and the similarity for each influencing factor is synthesized by using a weighted average method to obtain a value of the environmental similarity between the site to be predicted and the sample site.
[0027] The Gower similarity calculation method is expressed as:
[0029] The environmental similarity between the site to be predicted and the sample site is synthesized by using the weighted average method, and the calculation formula is as follows:
where S.sub.ij is the environmental similarity between site i and site j, and a, b . . . n are the weights of various environmental factors and e.sub.vi and e.sub.vj are the characteristics of the v-th environmental variable at site i and site), respectively. [0030] (3) Reliability of the sample site is measured.
[0031] The reliability of the sample site is calculated by using the environmental similarity between the sample sites and the similarity for target variables. The relationship between the sample sites is determined according to the environmental similarity between the sample sites and the similarity for target variables. The more supporting sample sites are, the higher the reliability is: the more contradictory sample sites are, the lower the reliability is; if the sample site has only contradictory sample sites but no supporting sample sites, the reliability of the site is 0; and if there is neither a supporting sample site nor a contradiction site, the reliability of the site is unknown, which is set to a null value “NoData”. The calculation formula of the reliability of the sample site is as follows:
where r.sub.i refers to the reliability of sample site i; n.sub.s and n.sub.c represent for sample site i, the number of supporting sample sites and the number of contradictory sample sites, respectively; TS.sub.i,k is based on the similarity for the target variable between the sample site i and the supporting sample site. [0032] (4) Spatial prediction is carried out on the rice stable isotope according to the environmental similarity between the site to be predicted and the sample site and the reliability of the sample site.
[0033] The value of each site to be predicted is calculated by using the weighted average method, and the formula is as follows:
where n′ is the number of sample sites satisfying the prediction condition, and S.sub.ji is the environmental similarity between the site to be predicted j and the sample site i; and V.sub.i is the target variable value of sample site a (the stable isotope value).
[0034] The calculation formula of the uncertainty of the prediction is as follows:
U.sub.j=1−max(S.sub.j1×r.sub.1,S.sub.j2×r.sub.2, . . . ,S.sub.jn×r.sub.n),
where n is the number of sample sites, and S.sub.jn is the environmental similarity of a sample site set for the prediction; r.sub.n is the reliability of the sample site, and the higher the similarity and reliability is, the lower the uncertainty value of the prediction is.
[0035] The effectiveness of the method used by the present disclosure is analyzed.
[0036] A cross-validation method is used, 70% of the sample sites are randomly selected as a training sample site set, and the remaining 30% are used as a verification sample site set. Circulation is carried out for ten times. The results of prediction are evaluated and analyzed. Subsequently, the evaluation of spatial prediction of stable isotope is realized by comparing with the existing regression-Kriging method.
[0037] With reference to the flow chart shown in
[0038] The study area is the main rice producing area in China. There are 794 sampling sites in the study area. The sampling was performed in 2017, involving 117 counties (cities or districts) in 17 provinces. The target variables of the prediction are stable carbon isotope, oxygen isotope and hydrogen isotope (δ.sup.13C, δ.sup.2H and δ.sup.18O), which are detected by isotope mass spectrometer (Isoprime 100, isoprime UK Ltd.). The spatial resolution of the prediction is 0.15°×0.15°.
[0039] According to the existing research results of stable isotope influencing factors, the present disclosure selects 10 influencing factors in 2017, including average annual temperature, average annual relative humidity, annual precipitation, annual sunshine hours, annual accumulated temperature (>10° C.), average temperature in the growing season (June to October), average relative humidity in the growing season, precipitation in the growing season, sunshine hours in the growing season and accumulated temperature in the growing season (>0.10′C), so as to establish an influencing factor database, and then the correlation between each factor and the stable isotope is analyzed. The significantly correlated factor (p<0.01) is selected as the auxiliary variable in the prediction. For the prediction of δ.sup.13C and δ.sup.18O, these 10 influencing factors are all significantly correlated, so that they are all used as auxiliary variables for prediction. For δ.sup.2H, in addition to the average temperature in the growing season and the accumulated temperature in the growing season, other factors are used as auxiliary factors for δ.sup.2H spatial prediction.
[0040] From the set of 794 sample sites, 555 (70%) sample sites are randomly selected as a training sample site set, and 239 (30%) sample sites are selected as a verification sample site set. The prediction model is established by using the training sample site set, and the verification sample site set is used to test and evaluate the prediction result of the model. Circulation is carried out for ten times in sequence.
[0041] The environmental similarity is calculated according to the prediction step 2. The reliability of the sample sites is calculated according to the prediction step 3. Finally, the sample sites are screened according to the environmental similarity and the reliability of the sample sites based on step 4, and then the sample site value of the sites to be predicted and the uncertainty of the prediction are calculated. The prediction average accuracies of δ.sup.13C, δ.sup.2H and δ.sup.18O are 0.51‰, 7.09‰ and 2.06‰, respectively. The average accuracies of δ.sup.13C, δ.sup.2H and δ.sup.18O predicted with the regression-geostatistical method are 0.54‰, 8.83‰ and 2.11‰, respectively. Generally speaking, the method of the present disclosure is more accurate in prediction than the regression-geostatistical method.
[0042] The spatial distribution diagram of δ.sup.13C, δ.sup.2H and δ.sup.18O of rice in China is obtained by the present disclosure, and the spatial distribution diagram of predicted uncertainty can also be obtained. The subsequent sampling can be landed according to the uncertainty of the prediction. More sampling sites can be set in areas with higher uncertainty, and fewer sampling sites can be set in areas with lower uncertainty, so that the sampling sites can be planned reasonably and the cost can be saved.
[0043] It can be understood that although the present disclosure has been disclosed in terms of preferred embodiments, the above embodiments are not intended to limit the present disclosure. For those skilled in the art, many possible changes and modifications can be made to the technical solution of the present disclosure by using the technical contents disclosed above, or the technical solution can be modified into equivalent embodiments with equivalent changes without departing from the scope of the technical solution of the present disclosure. Therefore, any simple modification, equivalent change and modification made to the above embodiment according to the technical essence of the present disclosure without departing from the content of the technical solution of the present disclosure still belongs to the scope of protection of the technical solution of the present disclosure.