Ranked TilingVisa övriga samt affilieringar
2014 (Engelska)Ingår i: Machine Learning and Knowledge Discovery in Databases: European Conference, ECML PKDD 2014, Nancy, France, September 15-19, 2014. Proceedings, Part II / [ed] Toon Calders; Floriana Esposito; Eyke Hüllermeier; Rosa Meo, Springer, 2014, s. 98-113Konferensbidrag, Publicerat paper (Refereegranskat)
Abstract [en]
Tiling is a well-known pattern mining technique. Traditionally, it discovers large areas of ones in binary databases or matrices, where an area is defined by a set of rows and a set of columns. In this paper, we introduce the novel problem of ranked tiling, which is concerned with finding interesting areas in ranked data. In this data, each transaction defines a complete ranking of the columns. Ranked data occurs naturally in applications like sports or other competitions. It is also a useful abstraction when dealing with numeric data in which the rows are incomparable.
We introduce a scoring function for ranked tiling, as well as an algorithm using constraint programming and optimization principles. We empirically evaluate the approach on both synthetic and real-life datasets, and demonstrate the applicability of the framework in several case studies. One case study involves a heterogeneous dataset concerning the discovery of biomarkers for different subtypes of breast cancer patients. An analysis of the tiles by a domain expert shows that our approach can lead to the discovery of novel insights.
Ort, förlag, år, upplaga, sidor
Springer, 2014. s. 98-113
Serie
Lecture Notes in Computer Science, ISSN 0302-9743, E-ISSN 1611-3349 ; 8725
Nyckelord [en]
tiling, ranked data, numerical data, pattern mining
Nationell ämneskategori
Data- och informationsvetenskap
Identifikatorer
URN: urn:nbn:se:oru:diva-92400DOI: 10.1007/978-3-662-44851-9_7Scopus ID: 2-s2.0-84907046858ISBN: 9783662448519 (digital)ISBN: 9783662448502 (tryckt)OAI: oai:DiVA.org:oru-92400DiVA, id: diva2:1567944
Konferens
European Conference on Machine Learning and Knowledge Discovery in Databases (ECML PKDD 2014), Nancy, France, September 15-19, 2014
2021-06-172021-06-172021-06-17Bibliografiskt granskad