Language-independence of DisCoCirc's Text Circuits: English and Urdu

Muhammad Hamza Waseem
Jonathon Liu
Vincent Wang-Maścianica
Bob Coecke

DisCoCirc is a newly proposed framework for representing the grammar and semantics of texts using compositional, generative circuits. While it constitutes a development of the Categorical Distributional Compositional (DisCoCat) framework, it exposes radically new features. In particular, [14] suggested that DisCoCirc goes some way toward eliminating grammatical differences between languages. In this paper we provide a sketch that this is indeed the case for restricted fragments of English and Urdu. We first develop DisCoCirc for a fragment of Urdu, as it was done for English in [14]. There is a simple translation from English grammar to Urdu grammar, and vice versa. We then show that differences in grammatical structure between English and Urdu - primarily relating to the ordering of words and phrases - vanish when passing to DisCoCirc circuits.

In Michael Moortgat and Gijs Wijnholds: Proceedings End-to-End Compositional Models of Vector-Based Semantics (E2ECOMPVEC), NUI Galway, 15-16 August 2022, Electronic Proceedings in Theoretical Computer Science 366, pp. 50–60.
Published: 10th August 2022.

ArXived at: https://dx.doi.org/10.4204/EPTCS.366.7 bibtex PDF
References in reconstructed bibtex, XML and HTML format (approximated).
Comments and questions to: eptcs@eptcs.org
For website issues: webmaster@eptcs.org