To Örebro University

oru.seÖrebro University Publications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Valid Text-to-SQL Generation with Unification-Based DeepStochLog
KU Leuven, Department of Computer Science, Leuven.AI, Leuven, Belgium.
Örebro University, School of Science and Technology. KU Leuven, Department of Computer Science, Leuven.AI, Leuven, Belgium. (Centre for Applied Autonomous Sensor Systems (AASS))ORCID iD: 0000-0002-6860-6303
KU Leuven, Department of Computer Science, Leuven.AI, Leuven, Belgium.
2024 (English)In: Neural-Symbolic Learning and Reasoning: 18th International Conference, NeSy 2024, Barcelona, Spain, September 9–12, 2024, Proceedings, Part I / [ed] Tarek R. Besold; Artur d’Avila Garcez; Ernesto Jimenez-Ruiz; Roberto Confalonieri; Pranava Madhyastha; Benedikt Wagner, Springer, 2024, Vol. 14979, p. 312-330Conference paper, Published paper (Refereed)
Abstract [en]

Large language models have been used to translate natural language questions to SQL queries. Without hard constraints on syntax and database schema, they occasionally produce invalid queries that are not executable. These failures limit the usage of these systems in real-life scenarios. We propose a neurosymbolic framework that imposes SQL syntax and schema constraints with unification-based definite clause grammars and thus guarantees the generation of valid queries. Our framework also builds a bi-directional interface to language models to leverage their natural language understanding abilities. The evaluation results on a subset of SQL grammars show that all our output queries are valid. This work is the first step towards extending language models with unificationbased grammars. We demonstrate this extension enhances the validity, execution accuracy, and ground truth alignment of the underlying language model by a large margin. Our code is available at https://github. com/ML- KULeuven/deepstochlog- lm.

Place, publisher, year, edition, pages
Springer, 2024. Vol. 14979, p. 312-330
Series
Lecture Notes in Computer Science (LNCS), ISSN 0302-9743, E-ISSN 1611-3349 ; Vol. 14979
Keywords [en]
Generative neurosymbolic, Language models, DeepStochLog, Text-to-SQL
National Category
Computer Sciences
Identifiers
URN: urn:nbn:se:oru:diva-117380DOI: 10.1007/978-3-031-71167-1_17ISI: 001329993500017Scopus ID: 2-s2.0-85204625517ISBN: 9783031711664 (print)ISBN: 9783031711671 (electronic)OAI: oai:DiVA.org:oru-117380DiVA, id: diva2:1913956
Conference
18th International Conference on Neural-Symbolic Learning and Reasoning (NeSy 2024), Barcelona, Spain, September 9-12, 2024
Available from: 2024-11-18 Created: 2024-11-18 Last updated: 2024-11-18Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textScopus

Authority records

De Raedt, Luc

Search in DiVA

By author/editor
De Raedt, Luc
By organisation
School of Science and Technology
Computer Sciences

Search outside of DiVA

GoogleGoogle Scholar

doi
isbn
urn-nbn

Altmetric score

doi
isbn
urn-nbn
Total: 76 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf