To Örebro University

oru.seÖrebro University Publications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Learning constraints in spreadsheets and tabular data
KU Leuven, Leuven, Belgium.
KU Leuven, Leuven, Belgium.
Vrije Universiteit Brussel, Brussels, Belgium.
KU Leuven, Leuven, Belgium.ORCID iD: 0000-0002-6860-6303
2017 (English)In: Machine Learning, ISSN 0885-6125, E-ISSN 1573-0565, no 106, p. 1441-1468Article in journal (Refereed) Published
Abstract [en]

Spreadsheets, comma separated value files and other tabular data representations are in wide use today. However, writing, maintaining and identifying good formulas for tabular data and spreadsheets can be time-consuming and error-prone. We investigate the automatic learning of constraints (formulas and relations) in raw tabular data in an unsupervised way. We represent common spreadsheet formulas and relations through predicates and expressions whose arguments must satisfy the inherent properties of the constraint. The challenge is to automatically infer the set of constraints present in the data, without labeled examples or user feedback. We propose a two-stage generate and test method where the first stage uses constraint solving techniques to efficiently reduce the number of candidates, based on the predicate signatures. Our approach takes inspiration from inductive logic programming, constraint learning and constraint satisfaction. We show that we are able to accurately discover constraints in spreadsheets from various sources.

Place, publisher, year, edition, pages
Springer, 2017. no 106, p. 1441-1468
Keywords [en]
Constraint learning, Tabular constraint learning, Spreadsheets, Excel, Constraint programming, Constraint discovery, Machine learning
National Category
Natural Sciences Computer Engineering
Identifiers
URN: urn:nbn:se:oru:diva-84382DOI: 10.1007/s10994-017-5640-xISI: 000409478700009Scopus ID: 2-s2.0-85020207644OAI: oai:DiVA.org:oru-84382DiVA, id: diva2:1451616
Available from: 2020-07-03 Created: 2020-07-03 Last updated: 2020-08-24Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textScopus

Authority records

De Raedt, Luc

Search in DiVA

By author/editor
De Raedt, Luc
In the same journal
Machine Learning
Natural SciencesComputer Engineering

Search outside of DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 87 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf