Advanced search
1 file | 2.79 MB Add to list

Graph neural networks for house price prediction : do or don't?

Author
Organization
Abstract
The domain of house price prediction, also referred to as real estate appraisal, has recently seen a shift from traditional statistical methodologies toward machine learning and deep learning techniques. As housing data is characterized by heterogeneous tabular data, and is subject to spatial dependencies, there is an exigent need for predictive models capable of capturing these complexities. Specifically, graph neural networks (GNNs) have been posited to discern spatial relationships by structuring housing data as graphs. Nevertheless, recent approaches frequently neglect alternative methods for graph construction, or lack a systematic comparative framework for different GNN approaches. Moreover, tree-based models, which are considered the state-of-the-art for tabular data, along with other contemporary methods, are often overlooked when evaluating GNNs. Therefore, this paper performs a comprehensive benchmark of graph construction methods and prevalent GNN models. Furthermore, we compare GNN approaches for house price prediction against an extensive suite of statistical, machine learning, and deep learning models. The results, drawn from six diverse housing datasets, reveal that GNNs are unsuccessful in surpassing machine learning and deep learning baselines. In particular, optimizing the graph structure yields only marginal improvements, with k-nearest neighbor graphs generally exhibiting superior performance. Among the GNN architectures evaluated, GraphSAGE and Transformer-based models demonstrate superior accuracy compared to other GNN variants. Ultimately, the findings suggest a general recommendation against the adoption of GNNs in favor of tree-based models such as LightGBM and CatBoost for house price prediction tasks.
Keywords
Graph neural networks, Graph construction, Real estate, Gradient boosted trees

Downloads

  • (...).pdf
    • full text (Published version)
    • |
    • UGent only
    • |
    • PDF
    • |
    • 2.79 MB

Citation

Please use this url to cite or link to this publication:

MLA
Geerts, Margot, et al. “Graph Neural Networks for House Price Prediction : Do or Don’t?” INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS, 2024, doi:10.1007/s41060-024-00682-y.
APA
Geerts, M., vanden Broucke, S., & De Weerdt, J. (2024). Graph neural networks for house price prediction : do or don’t? INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS. https://doi.org/10.1007/s41060-024-00682-y
Chicago author-date
Geerts, Margot, Seppe vanden Broucke, and Jochen De Weerdt. 2024. “Graph Neural Networks for House Price Prediction : Do or Don’t?” INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS. https://doi.org/10.1007/s41060-024-00682-y.
Chicago author-date (all authors)
Geerts, Margot, Seppe vanden Broucke, and Jochen De Weerdt. 2024. “Graph Neural Networks for House Price Prediction : Do or Don’t?” INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS. doi:10.1007/s41060-024-00682-y.
Vancouver
1.
Geerts M, vanden Broucke S, De Weerdt J. Graph neural networks for house price prediction : do or don’t? INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS. 2024;
IEEE
[1]
M. Geerts, S. vanden Broucke, and J. De Weerdt, “Graph neural networks for house price prediction : do or don’t?,” INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS, 2024.
@article{01JPMMGM8GKZWY3MWNF14YR6EH,
  abstract     = {{The domain of house price prediction, also referred to as real estate appraisal, has recently seen a shift from traditional statistical methodologies toward machine learning and deep learning techniques. As housing data is characterized by heterogeneous tabular data, and is subject to spatial dependencies, there is an exigent need for predictive models capable of capturing these complexities. Specifically, graph neural networks (GNNs) have been posited to discern spatial relationships by structuring housing data as graphs. Nevertheless, recent approaches frequently neglect alternative methods for graph construction, or lack a systematic comparative framework for different GNN approaches. Moreover, tree-based models, which are considered the state-of-the-art for tabular data, along with other contemporary methods, are often overlooked when evaluating GNNs. Therefore, this paper performs a comprehensive benchmark of graph construction methods and prevalent GNN models. Furthermore, we compare GNN approaches for house price prediction against an extensive suite of statistical, machine learning, and deep learning models. The results, drawn from six diverse housing datasets, reveal that GNNs are unsuccessful in surpassing machine learning and deep learning baselines. In particular, optimizing the graph structure yields only marginal improvements, with k-nearest neighbor graphs generally exhibiting superior performance. Among the GNN architectures evaluated, GraphSAGE and Transformer-based models demonstrate superior accuracy compared to other GNN variants. Ultimately, the findings suggest a general recommendation against the adoption of GNNs in favor of tree-based models such as LightGBM and CatBoost for house price prediction tasks.}},
  author       = {{Geerts, Margot and vanden Broucke, Seppe and De Weerdt, Jochen}},
  issn         = {{2364-415X}},
  journal      = {{INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS}},
  keywords     = {{Graph neural networks,Graph construction,Real estate,Gradient boosted trees}},
  language     = {{eng}},
  pages        = {{31}},
  title        = {{Graph neural networks for house price prediction : do or don't?}},
  url          = {{http://doi.org/10.1007/s41060-024-00682-y}},
  year         = {{2024}},
}

Altmetric
View in Altmetric