Ranked TilingShow others and affiliations
2014 (English)In: Machine Learning and Knowledge Discovery in Databases: European Conference, ECML PKDD 2014, Nancy, France, September 15-19, 2014. Proceedings, Part II / [ed] Toon Calders; Floriana Esposito; Eyke Hüllermeier; Rosa Meo, Springer, 2014, p. 98-113Conference paper, Published paper (Refereed)
Abstract [en]
Tiling is a well-known pattern mining technique. Traditionally, it discovers large areas of ones in binary databases or matrices, where an area is defined by a set of rows and a set of columns. In this paper, we introduce the novel problem of ranked tiling, which is concerned with finding interesting areas in ranked data. In this data, each transaction defines a complete ranking of the columns. Ranked data occurs naturally in applications like sports or other competitions. It is also a useful abstraction when dealing with numeric data in which the rows are incomparable.
We introduce a scoring function for ranked tiling, as well as an algorithm using constraint programming and optimization principles. We empirically evaluate the approach on both synthetic and real-life datasets, and demonstrate the applicability of the framework in several case studies. One case study involves a heterogeneous dataset concerning the discovery of biomarkers for different subtypes of breast cancer patients. An analysis of the tiles by a domain expert shows that our approach can lead to the discovery of novel insights.
Place, publisher, year, edition, pages
Springer, 2014. p. 98-113
Series
Lecture Notes in Computer Science, ISSN 0302-9743, E-ISSN 1611-3349 ; 8725
Keywords [en]
tiling, ranked data, numerical data, pattern mining
National Category
Computer and Information Sciences
Identifiers
URN: urn:nbn:se:oru:diva-92400DOI: 10.1007/978-3-662-44851-9_7Scopus ID: 2-s2.0-84907046858ISBN: 9783662448519 (electronic)ISBN: 9783662448502 (print)OAI: oai:DiVA.org:oru-92400DiVA, id: diva2:1567944
Conference
European Conference on Machine Learning and Knowledge Discovery in Databases (ECML PKDD 2014), Nancy, France, September 15-19, 2014
2021-06-172021-06-172021-06-17Bibliographically approved