The Programming of Algebra

Fritz Henglein
(University of Copenhagen)
Robin Kaarsgaard
(University of Edinburgh)
Mikkel Kragh Mathiesen
(University of Copenhagen)

We present module theory and linear maps as a powerful generalised and computationally efficient framework for the relational data model, which underpins today's relational database systems. Based on universal constructions of modules we obtain compact and computationally efficient data structures for data collections corresponding to union and deletion, repeated union, Cartesian product and key-indexed data. Free modules naturally give rise to polysets, which generalise multisets and facilitate expressing database queries as multilinear maps with asymptotically efficient evaluation on polyset constructors. We introduce compact maps as a way of representing infinite (poly)sets con- structible from an infinite base set and its elements by addition and subtraction. We show how natural joins generalise to algebraic joins, while intersection is implemented by a novel algorithm on nested compact maps that carefully avoids visiting parts of the input that do not contribute to the eventual output. Our algebraic framework leads to a worst-case optimal evaluation of cyclic relational queries, which is known to be impossible using textbook query optimisers that operate on lists of records only.

In Jeremy Gibbons and Max S. New: Proceedings Ninth Workshop on Mathematically Structured Functional Programming (MSFP 2022), Munich, Germany, 2nd April 2022, Electronic Proceedings in Theoretical Computer Science 360, pp. 71–92.
Published: 30th June 2022.

ArXived at: https://dx.doi.org/10.4204/EPTCS.360.4 bibtex PDF
References in reconstructed bibtex, XML and HTML format (approximated).
Comments and questions to: eptcs@eptcs.org
For website issues: webmaster@eptcs.org