Till Örebro universitet

oru.seÖrebro universitets publikationer
Ändra sökning
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
Predictive spreadsheet autocompletion with constraints
KU Leuven, Leuven, Belgium.
KU Leuven, Leuven, Belgium.
KU Leuven, Leuven, Belgium.
KU Leuven, Leuven, Belgium.ORCID-id: 0000-0002-6860-6303
2020 (Engelska)Ingår i: Machine Learning, ISSN 0885-6125, E-ISSN 1573-0565, Vol. 109, nr 2, s. 307-325Artikel i tidskrift (Refereegranskat) Published
Abstract [en]

Spreadsheets are arguably the most accessible data-analysis tool and are used by millions of people. Despite the fact that they lie at the core of most business practices, working with spreadsheets can be error prone, usage of formulas requires training and, crucially, spreadsheet users do not have access to state-of-the-art analysis techniques offered by machine learning. To tackle these issues, we introduce the novel task of predictive spreadsheet autocompletion, where the goal is to automatically predict the missing entries in the spreadsheets. This task is highly non-trivial: cells can hold heterogeneous data types and there might be unobserved relationships between their values, such as constraints or probabilistic dependencies. Critically, the exact prediction task itself is not given. We consider a simplified, yet non-trivial, setting and propose a principled probabilistic model to solve it. Our approach combines black-box predictive models specialized for different predictive tasks (e.g., classification, regression) and constraints and formulas detected by a constraint learner, and produces a maximally likely prediction for all target cells that is consistent with the constraints. Overall, our approach brings us one step closer to allowing end users to leverage machine learning in their workflows without writing a single line of code.

Ort, förlag, år, upplaga, sidor
Springer-Verlag New York, 2020. Vol. 109, nr 2, s. 307-325
Nyckelord [en]
Spreadsheets Autocompletion, Bayesian Networks, Constraint Learning, Machine Learning
Nationell ämneskategori
Datavetenskap (datalogi)
Identifikatorer
URN: urn:nbn:se:oru:diva-83315DOI: 10.1007/s10994-019-05841-yISI: 000492576000001Scopus ID: 2-s2.0-85074591152OAI: oai:DiVA.org:oru-83315DiVA, id: diva2:1442447
Anmärkning

Funding Agency:

European Research Council (ERC) 694980

Tillgänglig från: 2020-06-17 Skapad: 2020-06-17 Senast uppdaterad: 2020-11-19Bibliografiskt granskad

Open Access i DiVA

Fulltext saknas i DiVA

Övriga länkar

Förlagets fulltextScopus

Person

De Raedt, Luc

Sök vidare i DiVA

Av författaren/redaktören
De Raedt, Luc
I samma tidskrift
Machine Learning
Datavetenskap (datalogi)

Sök vidare utanför DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetricpoäng

doi
urn-nbn
Totalt: 96 träffar
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf