METHOD FOR PREDICTING A FEEDSTUFF AND/OR FEEDSTUFF RAW MATERIAL
20220244173 · 2022-08-04
Assignee
Inventors
- Ingolf REIMANN (Reinheim, DE)
- Joachim REISING (Kleinostheim, DE)
- Christoph MÜLLER (Offenbach am Main, DE)
Cpc classification
International classification
Abstract
A computer-implemented method for predicting a feedstuff and/or feedstuff raw material is described. The method comprises providing a near infrared (NIR) spectrum of a sample of an unknown feedstuff raw material and/or feedstuff. The absorption intensities of wavelengths or wavenumbers in the spectrum are transformed to give a query vector. A set of database vectors of a population of spectra of known feedstuff raw materials and/or feedstuffs is also provided, and these comprise at least 50 spectra of samples of each feedstuff and/or feedstuff raw material from each of its global growing areas. The similarity between the query vector and each of database vectors is analyzed to produce a score, and the feedstuff raw material and/or feedstuff of the database vector with the highest score is assigned to the sample.
Claims
1-14. (canceled)
15. A computer-implemented method for predicting a feedstuff and/or feedstuff raw material, the method comprising: a) providing a near infrared (NIR) spectrum of a sample of an unknown feedstuff raw material and/or feedstuff; b) transforming absorption intensities of wavelengths or wavenumbers in the spectrum of step a) to give a query vector; c) providing a set of database vectors of a population of spectra of known feedstuff raw materials and/or feedstuffs, wherein the population of spectra of known feedstuffs and/or feedstuff raw materials comprises at least 50 spectra of samples of each feedstuff and/or feedstuff raw material from each of its global growing areas; d) analyzing the similarity between the query vector of step b) and the set of database vectors of step c), the analyzing comprising: d1) calculating a similarity measure and/or a distance measure between each database vector of step c) and the query vector of step b) to give a similarity value for each database vector with the query vector; d2) ranking the similarity values obtained in step d1) in descending order, when the similarity measure is calculated in step d1) or in ascending order, when the distance measure is calculated in step d1), wherein the top-ranked database vector has the greatest similarity with the query vector; d3) counting the number of occurrences of each of the feedstuff raw materials and/or feedstuffs among the top-ranked database vectors in the ranking of step d2), wherein the number of occurrences is indicated by the variable N per each feedstuff raw material and/or feedstuff; d4) weighting the first N similarity values of each of the feedstuff raw materials and/or feedstuffs according to their position in the ranking of step d2) to give weighted rank positions of each of the feedstuff raw materials and/or feedstuffs, wherein the weighting of similarity values is done by taking the reciprocal rank position of the feedstuff raw material and/or feedstuff in ranking of step d2); and d5) forming the sum of the weighted rank positions of step d4) for each of the feedstuff raw materials and/or feedstuffs to give scores of each of the feedstuff raw materials and/or feedstuffs; and e) assigning the feedstuff raw material and/or feedstuff of the database vector with the highest score to the sample of step a).
16. The method of claim 15, wherein the vector in step b) and c) is a multi-dimensional vector, with each dimension corresponding to an absorption intensity of a specific wavelength or wavenumber.
17. The method of claim 15, wherein the spectrum in step a) and/or in step c) is recorded in a range of from 1,100 to 2,500 nm.
18. The method of claim 15, wherein the absorption intensities of equidistant wavelengths and/or wavenumbers in a spectrum are transformed to give a vector of a spectrum of step a) and/or of step b).
19. The method of claim 15, wherein the distances of the absorption intensities being transformed to vectors in step b) are identical with the distances of the absorptions intensities transformed to vectors in step c).
20. The method of claim 15, wherein the distance between the wavelengths or wavenumbers in step b) and/or step c) is from 0.1 nm+/−10% to 10 nm+/−10%, or from 10.sup.8 cm.sup.−1+/−10% to 10.sup.6 cm.sup.−1+/−10%.
21. The method of claim 15, wherein the number of the top-ranked database vectors to be considered in step d3) ranges from 10 to 100.
22. The method of claim 15, wherein the number of the top-ranked database vectors to be considered in step d3) ranges from 10 to 50.
23. The method of claim 15, wherein the population of spectra of known feedstuffs and/or feedstuff raw materials in step c) comprises spectra of all feedstuffs and/or feedstuff raw materials in ground and/or unground form used in animal nutrition.
24. The method of claim 15, further comprising forming a derivative of the spectrum of the unknown feedstuff raw material and/or feedstuff of step a) and/or of the spectra of known feedstuff raw materials and/or feedstuffs of step c).
25. A system for predicting a feedstuff raw material and/or feedstuff, comprising a processing unit adapted to carry out the method of claim 15.
26. The system of claim 25, wherein the processing unit forms a network with at least one other processing unit, on which the database vectors are stored.
Description
DESCRIPTION OF THE FIGURE
[0056]
EXAMPLE
[0057] The following is an example to illustrate the computer-implemented method according to the present invention in comparison to a simple prediction method of the prior art, and a prediction method involving a majority voting.
[0058] In the first step, the NIR spectrum of a sample of the feedstuff raw material FRM3 was recorded. The relevant information of said spectrum, i.e. absorption intensities of wavelengths, were transformed to give a query vector of said spectrum. In the next step, the similarity values of all database vectors with the query vector were calculated using a distance measure. The thus obtained similarity values were ranked according to their values including the indication of their corresponding feedstuff raw material and/or feedstuff in descending order to give a ranking of the database vector.
[0059] In the simple prediction model according to the prior art (in the following also referred to as Prediction method 1: 1-NN), the top-n vectors in this ranking represented the nearest neighbors in the similarity analysis between the query vector of the sample of an unknown feedstuff raw material and/or unknown feedstuff and the database vectors of a population of spectra of known feedstuff raw materials and/or feedstuffs.
[0060] The method with majority voting also started from the ranking of the database as mentioned above but also involves a counting of the occurrences of the material cases, i.e. the feedstuff raw material and/or feedstuff corresponding to the database vectors.
[0061] Finally, the method according to the present invention was used, which involves a majority-voting and a weighting of the thus obtained results.
TABLE-US-00001 TABLE 1 Ranking of database vectors according to their ranking and stating their corresponding type feedstuff raw material and/or feedstuff, i.e. FRM1. FRM2 or FRM3. Feedstuff raw Rank Similarity value Vector name material/feedstuff 1 0.93 V1 FRM1 2 0.91 V3 FRM3 3 0.74 V5 FRM3 4 0.69 V4 FRM3 5 0.42 V7 FRM2 6 0.33 V2 FRM3 7 0.27 V12 FRM2 8 0.15 V6 FRM2 9 0.04 V8 FRM1 10 0.03 V9 FRM1 11 0.02 V10 FRM2 12 0.01 V11 FRM2
[0062] In the prediction method 1 (1-NN) the feedstuff raw material with the highest similarity value according to the rankling of Table 1, i.e. FRM1, is indicated as the feedstuff raw material with the highest similarity to the material of the query vector.
[0063] By comparison, in the prediction method 2 (majority voting), the feedstuff raw material with the greatest number of occurrences in the ranking of Table 1, i.e. FRM2 with 5 votings over FRM1 with 3 votings and FRM2 with 4 votings, is indicated as the feedstuff raw material with the highest similarity to the material of the query vector.
[0064] In the prediction method 3. i.e. the method according to the present invention, the feedstuff raw material FRM3 is indicated as the feedstuff raw material with the highest similarity to the material of the query vector.
FRM3.fwdarw.1/2+1/3+1/4+1/6=1.249
FRM2.fwdarw.1/5+1/7+1/8+1/11+1/12=0.640
FRM1.fwdarw.1/1+1/9+1/10=1.211
[0065] The results of the three methods are extremely different. More importantly, two of the tested methods also give false positive results. Only the method according to the present invention gave the correct result.