CROP YIELD PREDICTION METHOD AND SYSTEM
20230024846 · 2023-01-26
Inventors
Cpc classification
G01N21/31
PHYSICS
International classification
G01N21/31
PHYSICS
Abstract
A crop yield prediction method and system. The method includes: obtaining a test normalized difference vegetation index and test meteorological data of a to-be-tested area; and inputting the test normalized difference vegetation index and the test meteorological data into a hierarchical linear regression model, to obtain a predicted yield of the to-be-tested area; where a method for determining the hierarchical linear regression model is: obtaining a training normalized difference vegetation index of a crop planting area; obtaining training meteorological data and measured yield data of the crop planting area; constructing a first regression equation and a second regression equation, where dependent variables of the second regression equation are a slope and an intercept of the first regression equation; and inputting the training normalized difference vegetation index and the measured yield data into the first regression equation, and inputting the training meteorological data into the second regression equation.
Claims
1. A crop yield prediction method, comprising: obtaining a test normalized difference vegetation index and test meteorological data of a to-be-tested area; and inputting the test normalized difference vegetation index and the test meteorological data into a hierarchical linear regression model, to obtain a predicted yield of the to-be-tested area; wherein a method for determining the hierarchical linear regression model is: obtaining a training normalized difference vegetation index of a crop planting area; obtaining training meteorological data and measured yield data of the crop planting area; constructing a first regression equation and a second regression equation, wherein dependent variables of the second regression equation are a slope and an intercept of the first regression equation; and inputting the training normalized difference vegetation index and the measured yield data into the first regression equation, and the training meteorological data into the second regression equation to train the first regression equation and the second regression equation, and determining the trained first regression equation as the hierarchical linear regression model.
2. The crop yield prediction method according to claim 1, wherein the obtaining a training normalized difference vegetation index of a crop planting area comprises: obtaining remote sensing image data of the crop planting area; calculating a spectral reflectance based on the remote sensing image data; and performing band calculation on the spectral reflectance to obtain the training normalized difference vegetation index.
3. The crop yield prediction method according to claim 2, wherein the remote sensing image data is Landsat image data; and bands of the Landsat image data comprises blue band, green band, red band, and near-infrared band.
4. The crop yield prediction method according to claim 2, wherein a formula for performing band calculation on the spectral reflectance is:
NDVI=(ρ.sub.NIR−ρ.sub.R)/(ρ.sub.NIR+ρ.sub.R), wherein ρ.sub.NIR is a spectral reflectance of near-infrared band; ρ.sub.R is a spectral reflectance of red band, and NDVI is the training normalized difference vegetation index.
5. The crop yield prediction method according to claim 1, wherein a formula of the first regression equation is:
Y.sub.ij=β.sub.0j+β.sub.1j×NVDI.sub.i+e.sub.ij, wherein β.sub.0j is the intercept of the first regression equation, β.sub.1j is the slope of the first regression equation, e.sub.ij is a random error of the first regression equation, Y.sub.ij is the i-th predicted yield, NDVI.sub.i is the i-th normalized difference vegetation index in the training normalized difference vegetation indices, and j is a numerical subscript.
6. The crop yield prediction method according to claim 5, wherein a formula of the second regression equation is:
β.sub.0j=γ.sub.00+γ.sub.01×RAD+γ.sub.02×T.sub.max+γ.sub.03×T.sub.min+γ.sub.04×PRE+μ.sub.0j;
β.sub.1j=γ.sub.10+γ.sub.11×RAD+γ.sub.12×T.sub.max+γ.sub.13×T.sub.min+γ.sub.14×PRE+μ.sub.1j, wherein γ.sub.00 is a first intercept of the second regression equation, γ.sub.10 is a second intercept of the second regression equation, RAD is average sunshine duration in the training meteorological data, γ.sub.01 is a first slope of the average sunshine duration, γ.sub.11 is a second slope of the average sunshine duration, T.sub.max is average daily maximum temperature in the training meteorological data, γ.sub.02 is a first slope of the average daily maximum temperature, γ.sub.12 is a second slope of the average daily maximum temperature, T.sub.min is average daily minimum temperature in the training meteorological data, γ.sub.03 is a first slope of the average daily minimum temperature, γ.sub.13 is a second slope of the average daily minimum temperature, PRE is average daily precipitation in the training meteorological data, γ.sub.04 is a first slope of the average daily precipitation, γ.sub.14 is a second slope of the average daily recipitation, μ.sub.0j is a first random error of the second regression equation, and μ.sub.1j is a second random error of the second regression equation.
7. The crop yield prediction method according to claim 1, wherein the crop in the to-be-tested area is corn.
8. The crop yield prediction method according to claim 7, wherein the corn is in the grain filling stage.
9. The crop yield prediction method according to claim 1, wherein the obtaining training meteorological data of a crop planting area comprises: obtaining a daily value data set of surface climate data, wherein the daily value data set of surface climate data comprises daily maximum temperature, daily minimum temperature, daily precipitation, and sunshine duration of the to-be-tested area; and calculating the training meteorological data based on the daily value data set of surface climate data, wherein the training meteorological data comprises average daily maximum temperature, average daily minimum temperature, average daily precipitation, and average sunshine duration.
10. A crop yield prediction system, comprising: a test data obtaining module, configured to obtain a test normalized difference vegetation index and test meteorological data of a to-be-tested area; and a prediction module, configured to input the test normalized difference vegetation index and the test meteorological data into a hierarchical linear regression model, to obtain a predicted yield of the to-be-tested area; wherein the prediction module comprises: a first obtaining module, configured to obtain a training normalized difference vegetation index of a crop planting area; a second obtaining module, configured to obtain training meteorological data and measured yield data of the crop planting area; a construction module, configured to construct a first regression equation and a second regression equation; and a training module, configured to train the first regression equation based on the training normalized difference vegetation index and the measured yield data, and train the second regression equation based on the training meteorological data, so as to obtain the hierarchical linear regression model, wherein dependent variables of the second regression equation are a slope and an intercept of the first regression equation.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
[0042] To describe the embodiments of the present disclosure or the technical solutions in the related art more clearly, the accompanying drawings required in the embodiments are briefly introduced below. Obviously, the accompanying drawings described below are only some embodiments of the present disclosure. A person of ordinary skill in the art may further obtain other accompanying drawings based on these accompanying drawings without creative efforts.
[0043]
[0044]
DETAILED DESCRIPTION OF THE EMBODIMENTS
[0045] The technical solutions in the embodiments of the present disclosure will be described below clearly and completely with reference to the accompanying drawings in the embodiments of the present disclosure. Apparently, the described embodiments are merely some rather than all of the embodiments of the present disclosure. All other embodiments obtained by a person of ordinary skill in the art based on the embodiments of the present disclosure without creative efforts shall fall within the protection scope of the present disclosure.
[0046] To make the above-mentioned objective, features, and advantages of the present disclosure clearer and more comprehensible, the present disclosure will be further described in detail below in conjunction with the accompanying drawings and specific embodiments.
[0047]
[0048] Step 100: obtain a test normalized difference vegetation index and test meteorological data of a to-be-tested area; and
[0049] Step 200: input the test normalized difference vegetation index and the test meteorological data into a hierarchical linear regression model, to obtain a predicted yield of the to-be-tested area.
[0050] A method for determining the hierarchical linear regression model is:
[0051] Step 201: obtain a training normalized difference vegetation index of a crop planting area;
[0052] Step 202: obtain training meteorological data and measured yield data of the crop planting area;
[0053] Step 203: construct a first regression equation and a second regression equation, where dependent variables of the second regression equation are a slope and an intercept of the first regression equation; and
[0054] Step 204: input the training normalized difference vegetation index and the measured yield data into the first regression equation, and the training meteorological data into the second regression equation to train the first regression equation and the second regression equation, and determine the trained first regression equation as the hierarchical linear regression model.
[0055] Preferably, the crop in the to-be-tested area is corn. The corn is in the grain filling stage.
[0056] Optionally, this embodiment aims to provide a method for predicting corn yields in a large area under multiple weather conditions, to accurately predict corn yields at unknown areas.
[0057] Preferably, the obtaining a training normalized difference vegetation index of a crop planting area includes:
[0058] obtaining remote sensing image data of the crop planting area;
[0059] calculating a spectral reflectance based on the remote sensing image data; and
[0060] performing band calculation on the spectral reflectance to obtain the training normalized difference vegetation index.
[0061] Preferably, the remote sensing image data is Landsat image data; and bands of the Landsat image data include blue band, green band, red band, and near-infrared band.
[0062] Specifically, the remote sensing data is Landsat image data, which covers nine bands, including blue, green, red, and near-infrared bands, a wavelength ranges from 0.43 to 1.38, and multispectral band spatial resolution is 30 meters.
[0063] Preferably, the obtaining training meteorological data of the crop planting area includes:
[0064] obtaining a daily value data set of surface climate data, where the daily value data set of surface climate data includes daily maximum temperature, daily minimum temperature, daily precipitation, and sunshine duration of the to-be-tested area; and
[0065] calculating the training meteorological data based on the daily value data set of surface climate data, where the training meteorological data includes average daily maximum temperature, average daily minimum temperature, average daily precipitation, and average sunshine duration.
[0066] Optionally, the meteorological data comes from site meteorological information published at the China Meteorological Data website, and the data set is the daily value data set of China surface climate data (V3.0), including daily maximum temperature, daily minimum temperature, daily precipitation, and sunshine duration.
[0067] In an optional embodiment, the measured yield data comes from field yield data.
[0068] Preferably, a formula for performing band calculation on the spectral reflectance is: NDVI=(ρ.sub.NIR−ρ.sub.R)/(ρ.sub.NIR+ρ.sub.R), where
[0069] ρ.sub.NIR is a spectral reflectance of near-infrared band; ρ.sub.R is a spectral reflectance of red band, and NDVI is the training normalized difference vegetation index.
[0070] When a light source illuminates the surface of an object, the object selectively reflects electromagnetic waves of different wavelengths. The spectral reflectance is a ratio of the luminous flux reflected by the object at a band to the luminous flux received by the object, and is an essential property of the object surface. The spectral reflectance is the representation of color by the object itself, which not only records the color information of the object, but also represents the surface material of the object.
[0071] In this embodiment, the first regression equation and the second regression equation are combined to form a hierarchical linear regression model. When data is present in different levels, variables at the first layer are used to construct a regression equation, and then an intercept and a slope in the equation are used as dependent variables, and variables at the second layer are used as independent variables, to construct two new equations. In this way, influence of variables at different layers on dependent variables can be explored. Because the intercept and slope in the first-layer regression equation are used as random variables in the second-layer regression equation, this is also called “regression of regression”.
[0072] Preferably, a formula of the first regression equation is: Y.sub.ij=β.sub.0j+β.sub.1j×NVDI.sub.i+e.sub.ij, where
[0073] β.sub.0j is the intercept of the first regression equation, β.sub.1j is the slope of the first regression equation, e.sub.ij is a random error of the first regression equation, Y.sub.ij is the i-th predicted yield, NDVI.sub.i is the i-th training normalized difference vegetation index, and j is a numerical subscript.
[0074] Specifically, the first regression equation constitutes the first layer of the hierarchical linear regression model, and the first layer is similar to an ordinary least squares (OLS) regression model, and includes an independent variable remote sensing parameter (NDVI) and a dependent variable Y (yield).
[0075] Preferably, a formula of the second regression equation is as follows:
β.sub.0j=γ.sub.00+γ.sub.01×RAD+γ.sub.02×T.sub.max+γ.sub.03×T.sub.min+γ.sub.04×PRE+μ.sub.0j;
β.sub.1j=γ.sub.10+γ.sub.11×RAD+γ.sub.12×T.sub.max+γ.sub.13×T.sub.min+γ.sub.14×PRE+μ.sub.1j, where
[0076] γ.sub.00 is a first intercept of the second regression equation, γ.sub.10 is a second intercept of the second regression equation, RAD is average sunshine duration in the training meteorological data, γ.sub.01 is a first slope of the average sunshine duration, γ.sub.11 is a second slope of the average sunshine duration, T.sub.max is average daily maximum temperature in the training meteorological data, you is a first slope of the average daily maximum temperature, γ.sub.12 is a second slope of the average daily maximum temperature, T.sub.min is average daily minimum temperature in the training meteorological data, γ.sub.03 is a first slope of the average daily minimum temperature, γ.sub.13 is a second slope of the average daily minimum temperature, PRE is average daily precipitation in the training meteorological data, γ.sub.04 is a first slope of the average daily precipitation, γ.sub.14 is a second slope of the average daily precipitation, μ.sub.0j is a first random error of the second regression equation, and μ.sub.1j is a second random error of the second regression equation.
[0077] Specifically, the second-layer equation is as follows:
[0078] β.sub.mj=γ.sub.m0+γ.sub.m1×RAD+γ.sub.m2×T.sub.max+γ.sub.m3×T.sub.min+γ.sub.m4×PRE+μ.sub.mj. When m is equal to 1, the dependent variable is the slope in the first-layer model; when m is equal to 0, the dependent variable is equal to the intercept in the first-level model. Independent variables in the second-layer model are meteorological parameters (RAD, PRE, T.sub.max, T.sub.min). Quantities to be solved are γ.sub.m1 to γ.sub.m4.
[0079] This embodiment also provides a crop yield prediction system.
[0080] a test data obtaining module, configured to obtain a test normalized difference vegetation index and test meteorological data of a to-be-tested area; and
[0081] a prediction module, configured to input the test normalized difference vegetation index and the test meteorological data into a hierarchical linear regression model, to obtain a predicted yield of the to-be-tested area. The hierarchical linear regression model is determined by a regression model construction module.
[0082] The regression model construction module includes:
[0083] a first obtaining module, configured to obtain a training normalized difference vegetation index of a crop planting area;
[0084] a second obtaining module, configured to obtain training meteorological data and measured yield data of the crop planting area;
[0085] a construction module, configured to construct a first regression equation and a second regression equation, where dependent variables of the second regression equation are a slope and an intercept of the first regression equation; and
[0086] a training module, configured to input the training normalized difference vegetation index and the measured yield data into the first regression equation, and the training meteorological data into the second regression equation to train the first regression equation and the second regression equation, and determine the trained first regression equation as the hierarchical linear regression model.
[0087] Specifically, the first obtaining module includes:
[0088] a first obtaining unit, configured to obtain remote sensing image data of a corn planting area;
[0089] a first calculation unit, configured to calculate a spectral reflectance based on the remote sensing image data; and
[0090] a second calculation unit, configured to perform band calculation on the spectral reflectance to obtain the training normalized difference vegetation index.
[0091] The present disclosure has the following beneficial effects:
[0092] Based on hierarchical linear regression modeling, the present disclosure combines spectral information of the corn grain filling stage obtained by sensors with local meteorological data, and constructs a model relationship between corn yields and spectral and meteorological data through hierarchical linear regression equations, so as to predict corn yields in unknown areas based on the spectral information and meteorological data. The present disclosure can enhance model adaptability with relatively less data, and does not require massive information collection in the early stage, making crop yield prediction much easier. Moreover, compared with a pure linear or nonlinear model, the present disclosure can achieve more accurate prediction results.
[0093] Each embodiment of the present specification is described in a progressive manner, each embodiment focuses on the difference from other embodiments, and mutual reference may be made between the same and similar parts. Since the system disclosed in an embodiment corresponds to the method disclosed in another embodiment, the description is relatively simple, and reference can be made to the method description.
[0094] Specific examples are used herein to explain the principles and embodiments of the present disclosure. The foregoing description of the embodiments is merely intended to help understand the method of the present disclosure and its core ideas; besides, various modifications may be made by a person of ordinary skill in the art to specific embodiments and the scope of application in accordance with the ideas of the present disclosure. In conclusion, the content of the present specification shall not be construed as limitations to the present disclosure.