Relational Regularization and Feature Ranking
2014 (English)In: Proceedings of the 2014 SIAM International Conference on Data Mining (SDM) / [ed] M. Zaki; Z. Obradovic; P. Ning Tan; A. Banerjee; C. Kamath; S. Parthasarathy, Society for Industrial and Applied Mathematics Publications , 2014, Vol. 2, p. 650-658Conference paper, Published paper (Refereed)
Abstract [en]
Regularization is one of the key concepts in machine learning, but so far it has received only little attention in the logical and relational learning setting. Here we propose a regularization and feature selection technique for such setting, in which one commonly represents the structure of the domain using an entity-relationship model. To this end, we introduce a notion of locality that ties together features according to their proximity in a transformed representation of the relational learning problem obtained via a procedure that we call “graphicalization”. We present two techniques, a wrapper and an efficient embedded approach, to identify the most relevant sets of predicates which yields more readily interpretable results than selecting low-level propositionalized features. The proposed techniques are implemented in the kernel-based relational learner kLog, although the ideas presented here can also be adapted to other relational learning frameworks. We evaluate our approach on classification tasks in the natural language processing and bioinformatics domain.
Place, publisher, year, edition, pages
Society for Industrial and Applied Mathematics Publications , 2014. Vol. 2, p. 650-658
National Category
Computer and Information Sciences
Identifiers
URN: urn:nbn:se:oru:diva-92320DOI: 10.1137/1.9781611973440.75Scopus ID: 2-s2.0-84933075459ISBN: 9781611973440 (electronic)ISBN: 9781510811515 (print)OAI: oai:DiVA.org:oru-92320DiVA, id: diva2:1565310
Conference
14th SIAM International Conference on Data Mining (SDM 2014), Philadephia, Pennsylvania, USA, April 24-26, 2014
Funder
EU, FP7, Seventh Framework Programme, 255951
Note
Funding Agencies:
German Research Foundation (DFG-grant BA 2168/3-1)
Fonds Wetenschappelijk Onderzoek G.0478.10
Deutsche Forschungsgemeinschaft
2021-06-142021-06-142021-06-15Bibliographically approved