SMART REAL ESTATE EVALUATION SYSTEM
20230027774 ยท 2023-01-26
Inventors
- Tien-Hao Chang (Chiayi City, TW)
- Chin-Mei Chiang (New Taipei City, TW)
- Sheng-Wen Huang (Taipei City, TW)
- Shau-Wei Huang (New Taipei City, TW)
- Po-Yu Shen (Tainan City, TW)
- Li-Hao Zeng (Yilan County, TW)
Cpc classification
G06N5/01
PHYSICS
International classification
Abstract
To automatically evaluate the reasonable price of real estate according to the housing data, the present invention discloses a novel intelligent property evaluation system. The system includes the following components: a housing data input system, a pre-processing filter, a feature extractor, a housing price trainer, and a housing price predictor, wherein the housing price predictor further includes a regression model generator and a decision integrator. The pre-processing filter is used to filter unreasonable samples from housing data and integrate synonymous features. The feature extractor is used to choose required variables of housing price model. The housing price trainer generates housing price model which is trained by a great amount of housing data. The housing price predictor then generates a prediction by the trained model. Furthermore, to maintain the accuracy of prediction under the social evolution, the housing price predictor could be regularly or irregularly updated by a rolling-based method.
Claims
1. A smart real estate evaluation system, comprising: a housing data input system to regularly or irregularly input a plurality of housing data of multiple objects; a feature extractor coupled with said housing data input system to extract plurality of housing data, wherein said feature extractor comprises a variable manager to manage dimensions of variables in said system and generates feature vectors from said plurality of housing data; a housing price trainer coupled with said feature extractor to train a housing price model through said feature vectors; and a housing price predictor to predict a housing price through said housing price model.
2. The system of claim 1, further comprising a pre-processing filter coupled with said housing data input system to filter unreasonable data and integrate synonymous features.
3. The system of claim 2, wherein said pre-processing filter includes a categorical data merger to merge fields with similar properties in said plurality of housing data.
4. The system of claim 1, wherein said housing price predictor includes a regression model generator to regresses variables in said feature vectors through regression trees.
5. The system of claim 4, wherein an algorithm of said regression model generator includes a gradient boosting decision tree (GBDT), Catboost, XGBoost (eXtreme Gradient Boosting), LightGBM or the combination thereof.
6. The system of claim 4, wherein said the housing price predictor includes a decision integrator to predict said housing price according to a result of regression operation of said regression model generator.
7. The system of claim 1, wherein said feature vectors are a high-dimensional matrix containing multiple variables, and each object corresponds to its corresponding feature vector.
8. The system of claim 1, wherein said regression model generator generates multiple regression trees according to variables in said feature vectors, each regression tree is equivalent to a weak learner.
9. The system of claim 8, wherein said decision integrator integrates results of said multiple regression trees so that said housing price model is created by multiple weak learners constituting a strong learner.
10. The system of claim 1, wherein said variable manager selects corresponding fields in plurality of housing data.
11. An executing method for smart real estate evaluation system, comprising: inputting a plurality of housing data of multiple objects by a housing data input system; transmitting said plurality of housing data to a pre-processing filter for housing data pre-processing; extracting features suitable for evaluating a housing price based on said plurality housing data by a feature extractor; generate a housing price model by a housing price trainer and a housing price predictor; and predicting a housing price of a target object based on said housing price model by a housing price predictor.
12. The method of claim 11, wherein said plurality of housing data are from a service network of actual price registration of real estate transaction, or other resources that provide said plurality housing data.
13. The method of claim 11, further comprising a variable dimension processing, said features are selected by a forward selection method or a backward selection method by a variable manager to generate feature vectors.
14. The method of claim 13, wherein said housing price predictor includes a regression model generator to regresses variables in said feature vectors through regression trees.
15. The method of claim 14, wherein an algorithm of said regression model generator includes a gradient boosting decision tree (GBDT), Catboost, XGBoost (eXtreme Gradient Boosting), LightGBM or the combination thereof.
16. The method of claim 14, wherein said the housing price predictor includes a decision integrator to predict said housing price according to a result of regression operation of said regression model generator.
17. The method of claim 13, wherein said feature vectors are a high-dimensional matrix containing multiple variables, and each object corresponds to its corresponding feature vector.
18. The method of claim 14, wherein said regression model generator generates multiple regression trees according to variables in said feature vectors, each regression tree is equivalent to a weak learner.
19. The method of claim 18, wherein said decision integrator integrates results of said multiple regression trees so that said housing price model is created by multiple weak learners constituting a strong learner.
20. The method of claim 11, wherein said variable manager selects corresponding fields in plurality of housing data.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
[0015] The components, characteristics and advantages of the present invention may be understood by the detailed descriptions of the preferred embodiments outlined in the specification and the drawings attached:
[0016]
[0017]
[0018]
[0019]
[0020]
[0021]
[0022]
DETAILED DESCRIPTION
[0023] Some preferred embodiments of the present invention will now be described in greater detail. However, it should be recognized that the preferred embodiments of the present invention are provided for illustration rather than limiting the present invention. In addition, the present invention can be practiced in a wide range of other embodiments besides those explicitly described, and the scope of the present invention is not expressly limited except as specified in the accompanying claims.
[0024] The purpose of the invention is to improve performing processes of previous technology for predicting the selling price of objects in the housing price model. By improving the processes in the stages of housing data pre-processing, housing feature extraction and building housing price model and proposing the best applicable algorithm, it can improve generalizability and apply to various kinds of objects, such as apartments, buildings, townhouse, etc. The key points of improvements include as follows: first, in the stage of housing data pre-processing, the objects that missing data or exceed the reasonable range is screened; secondly, when extracting housing features, the appropriate data dimensions is analyzed and screened out according to the previous housing data; third, when building the housing price model, the housing price model can be continuously trained at a specific time interval, so that it can timely update and reflect social and economic changes, increase the accuracy of housing price prediction, and reduce the subjectivity of human analysis of housing prices.
[0025] In order to achieve the above purpose, please refer to
[0026] In an embodiment of the invention, when selecting variables, the variable manager 105c selects the corresponding fields (features or variables) in the housing data according to the needs of the application, such as room type, floor, housing size, etc. of the object, and the selection method is forward selection or/and backward selection. The forward selection method refers to select the significant (distinctive) housing features one by one into the model until all significant housing features are selected into the model. The backward selection method refers to eliminate the insignificant housing features one by one until all housing features stored in the model are significant. In addition, when the transaction is a multistorey building for sale, because a transaction involves (contains) multiple floors, the feature vectors need additional variable dimensions to describe this situation when training the housing price model. When the housing data includes parking spaces, it will also need to add an additional dimension of variables to indicate whether the housing data includes parking spaces, so as to improve the accuracy of the housing price model.
[0027] Referring to
[0028] Referring to
[0029] According to one embodiment of the invention, the housing price predictor 107 includes a regression model generator 111, which regresses housing price on feature vector through the regression tree. The regression algorithm of the regression model generator 111 can be gradient boosting decision tree (GBDT), Catboost, XGBoost (eXtreme Gradient Boosting), LightGBM, etc. or a combination of the above algorithms. The above feature vectors can be a high-dimensional matrix containing multiple variables (features or columns), and each object will correspond to its corresponding feature vector (rows). For example, when the transaction of an object involves the purchase and sale of multiple floors, such as the purchase of an apartment on two floors. Because each household has corresponding field values of room type, floor and area, it is difficult to express the variable in one dimension when trading multiple floors. Therefore, the multi-hot encoding technique is used to express this situation, referring to
[0030] According to one embodiment of the invention, when the regression model generator 111 generates multiple regression trees according to the variables in the feature vector, each regression tree is equivalent to a weak learner. For example, take objects 1-4 in
[0031] Referring to
[0032] Then, after the housing data pre-processing 603 stage is completed, it enters a housing feature extraction 605 stage. In this process, the feature extractor 105 extracts the features suitable for evaluating the housing price according to the housing data or the application scenarios of the transaction, such as housing age, housing type, adjacent facilities, housing size, etc., and ignore the less important factors according to the application needs, such as celebrity endorsement. In the stage of variable dimension processing 605c, the above features can be further selected by, for example, forward selection method or/and backward selection method. The variable manager 105c determines the number of features need to be used to generate feature vectors based-on the application scenarios, such as the computing resources of CPU and GPU, or the amount of information loss in the loss function.
[0033] Following by the above, when the feature vector is generated, it enters the house price modeling stage 607. In this process, the housing price trainer 109 and the housing price predictor 107 can generate a housing price model by an algorithm based on regression tree such as GDBT, Catboost, XGBoost (eXtreme Gradient Boosting), LightGBM or a combination of the above algorithms. Then, testing of housing price model 607c is performed on the test data, which is independent of the training data. If the generated housing price model can reach a certain accuracy in the test data, for example, the mean absolute percentage error (MAPE) is within 3%-10%, or the percentage of the absolute percentage error less than 10%, namely hit rate, is 60%-90%, the stage of building housing price model 607 is completed.
[0034] Subsequently, when the housing price model passes the test, the housing price predictor 107 predicts the housing price of the target object according to the housing price model. Finally, in the stage of rolling updating the housing price model 611, the housing data input system 101 inputs the latest housing data in the uncertain period or a fixed time interval T, and repeats the above steps to generate a new housing price model to meet the latest market trend. Meanwhile, it can also adjust the processing method in the feature extractor 105 according to the accuracy of the housing price model, such as the number of features, the feature vector, or the parameters of the algorithm in the housing price trainer 109.
[0035] As will be understood by persons skilled in the art, the foregoing preferred embodiment of the present invention illustrates the present invention rather than limiting the present invention. Having described the invention in connection with a preferred embodiment, modifications will be suggested to those skilled in the art. Thus, the invention is not to be limited to this embodiment, but rather the invention is intended to cover various modifications and similar arrangements included within the spirit and scope of the appended claims, the scope of which should be accorded the broadest interpretation, thereby encompassing all such modifications and similar structures. While the preferred embodiment of the invention has been illustrated and described, it will be appreciated that various changes can be made without departing from the spirit and scope of the invention.