TY - GEN
T1 - Model Selection Using Graph Neural Networks
AU - Nápoles, Gonzalo
AU - Grau, Isel
AU - Güven, Çiçek
AU - Salgueiro, Yamisleydi
N1 - Publisher Copyright:
© The Author(s), under exclusive license to Springer Nature Switzerland AG 2024.
PY - 2024
Y1 - 2024
N2 - This paper tackles the problem of selecting the optimal models (algorithms and their hyperparameters) for a structured classification problem using Graph Neural Networks (GNNs). Recent efforts in this direction associate statistical meta-features describing the problem with the performance of predefined models. However, the predictive power of these meta-features is insufficient while being expensive to compute. The approach presented in this paper encodes each problem as a granular knowledge graph where nodes denote prototypes, while edges capture their distance. Moreover, nodes are labeled with the most popular class in their neighborhood, and their quality is quantified with a purity score. The adjacency-based representations of these knowledge graphs establish positive arrows between close prototypes that belong to different decision classes. Therefore, solving the multilabel model selection problem consists of predicting the set of optimal models for a given dataset represented by its adjacency-based matrix knowledge graph. The results indicate that the proposed GNN-based meta-classifier can predict an optimal model for 92% of the datasets, suppressing the need to extract low-level features.
AB - This paper tackles the problem of selecting the optimal models (algorithms and their hyperparameters) for a structured classification problem using Graph Neural Networks (GNNs). Recent efforts in this direction associate statistical meta-features describing the problem with the performance of predefined models. However, the predictive power of these meta-features is insufficient while being expensive to compute. The approach presented in this paper encodes each problem as a granular knowledge graph where nodes denote prototypes, while edges capture their distance. Moreover, nodes are labeled with the most popular class in their neighborhood, and their quality is quantified with a purity score. The adjacency-based representations of these knowledge graphs establish positive arrows between close prototypes that belong to different decision classes. Therefore, solving the multilabel model selection problem consists of predicting the set of optimal models for a given dataset represented by its adjacency-based matrix knowledge graph. The results indicate that the proposed GNN-based meta-classifier can predict an optimal model for 92% of the datasets, suppressing the need to extract low-level features.
KW - Graphical neural networks
KW - Meta-classifier
KW - Meta-features
KW - Multilabel model selection problem
UR - http://www.scopus.com/inward/record.url?scp=85201079296&partnerID=8YFLogxK
U2 - 10.1007/978-3-031-66428-1_20
DO - 10.1007/978-3-031-66428-1_20
M3 - Conference contribution
AN - SCOPUS:85201079296
SN - 9783031664274
T3 - Lecture Notes in Networks and Systems
SP - 332
EP - 347
BT - Intelligent Systems and Applications - Proceedings of the 2024 Intelligent Systems Conference IntelliSys Volume 2
A2 - Arai, Kohei
PB - Springer Science and Business Media Deutschland GmbH
T2 - Intelligent Systems Conference, IntelliSys 2024
Y2 - 5 September 2024 through 6 September 2024
ER -