
Graph neural networks for house price prediction : do or don't?
- Author
- Margot Geerts, Seppe vanden Broucke (UGent) and Jochen De Weerdt
- Organization
- Abstract
- The domain of house price prediction, also referred to as real estate appraisal, has recently seen a shift from traditional statistical methodologies toward machine learning and deep learning techniques. As housing data is characterized by heterogeneous tabular data, and is subject to spatial dependencies, there is an exigent need for predictive models capable of capturing these complexities. Specifically, graph neural networks (GNNs) have been posited to discern spatial relationships by structuring housing data as graphs. Nevertheless, recent approaches frequently neglect alternative methods for graph construction, or lack a systematic comparative framework for different GNN approaches. Moreover, tree-based models, which are considered the state-of-the-art for tabular data, along with other contemporary methods, are often overlooked when evaluating GNNs. Therefore, this paper performs a comprehensive benchmark of graph construction methods and prevalent GNN models. Furthermore, we compare GNN approaches for house price prediction against an extensive suite of statistical, machine learning, and deep learning models. The results, drawn from six diverse housing datasets, reveal that GNNs are unsuccessful in surpassing machine learning and deep learning baselines. In particular, optimizing the graph structure yields only marginal improvements, with k-nearest neighbor graphs generally exhibiting superior performance. Among the GNN architectures evaluated, GraphSAGE and Transformer-based models demonstrate superior accuracy compared to other GNN variants. Ultimately, the findings suggest a general recommendation against the adoption of GNNs in favor of tree-based models such as LightGBM and CatBoost for house price prediction tasks.
- Keywords
- Graph neural networks, Graph construction, Real estate, Gradient boosted trees
Downloads
-
(...).pdf
- full text (Published version)
- |
- UGent only
- |
- |
- 2.79 MB
Citation
Please use this url to cite or link to this publication: http://hdl.handle.net/1854/LU-01JPMMGM8GKZWY3MWNF14YR6EH
- MLA
- Geerts, Margot, et al. “Graph Neural Networks for House Price Prediction : Do or Don’t?” INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS, 2024, doi:10.1007/s41060-024-00682-y.
- APA
- Geerts, M., vanden Broucke, S., & De Weerdt, J. (2024). Graph neural networks for house price prediction : do or don’t? INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS. https://doi.org/10.1007/s41060-024-00682-y
- Chicago author-date
- Geerts, Margot, Seppe vanden Broucke, and Jochen De Weerdt. 2024. “Graph Neural Networks for House Price Prediction : Do or Don’t?” INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS. https://doi.org/10.1007/s41060-024-00682-y.
- Chicago author-date (all authors)
- Geerts, Margot, Seppe vanden Broucke, and Jochen De Weerdt. 2024. “Graph Neural Networks for House Price Prediction : Do or Don’t?” INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS. doi:10.1007/s41060-024-00682-y.
- Vancouver
- 1.Geerts M, vanden Broucke S, De Weerdt J. Graph neural networks for house price prediction : do or don’t? INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS. 2024;
- IEEE
- [1]M. Geerts, S. vanden Broucke, and J. De Weerdt, “Graph neural networks for house price prediction : do or don’t?,” INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS, 2024.
@article{01JPMMGM8GKZWY3MWNF14YR6EH, abstract = {{The domain of house price prediction, also referred to as real estate appraisal, has recently seen a shift from traditional statistical methodologies toward machine learning and deep learning techniques. As housing data is characterized by heterogeneous tabular data, and is subject to spatial dependencies, there is an exigent need for predictive models capable of capturing these complexities. Specifically, graph neural networks (GNNs) have been posited to discern spatial relationships by structuring housing data as graphs. Nevertheless, recent approaches frequently neglect alternative methods for graph construction, or lack a systematic comparative framework for different GNN approaches. Moreover, tree-based models, which are considered the state-of-the-art for tabular data, along with other contemporary methods, are often overlooked when evaluating GNNs. Therefore, this paper performs a comprehensive benchmark of graph construction methods and prevalent GNN models. Furthermore, we compare GNN approaches for house price prediction against an extensive suite of statistical, machine learning, and deep learning models. The results, drawn from six diverse housing datasets, reveal that GNNs are unsuccessful in surpassing machine learning and deep learning baselines. In particular, optimizing the graph structure yields only marginal improvements, with k-nearest neighbor graphs generally exhibiting superior performance. Among the GNN architectures evaluated, GraphSAGE and Transformer-based models demonstrate superior accuracy compared to other GNN variants. Ultimately, the findings suggest a general recommendation against the adoption of GNNs in favor of tree-based models such as LightGBM and CatBoost for house price prediction tasks.}}, author = {{Geerts, Margot and vanden Broucke, Seppe and De Weerdt, Jochen}}, issn = {{2364-415X}}, journal = {{INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS}}, keywords = {{Graph neural networks,Graph construction,Real estate,Gradient boosted trees}}, language = {{eng}}, pages = {{31}}, title = {{Graph neural networks for house price prediction : do or don't?}}, url = {{http://doi.org/10.1007/s41060-024-00682-y}}, year = {{2024}}, }
- Altmetric
- View in Altmetric