Core Concepts
Morphological means of expressing generic temporal subordination ('when'-clauses) are prevalent in many Latin American and Caribbean languages, posing challenges for token-based typological approaches. This study incorporates character n-gram analysis to capture such morphological markers alongside lexical subordinators, generating probabilistic semantic maps that reveal systematic cross-linguistic variation in the region.
Abstract
This paper explores the expression of generic temporal subordination ('when'-clauses) in Latin American and Caribbean languages, where morphological marking is particularly common in addition to lexical subordinators.
The study uses a massively parallel corpus of New Testament translations to generate probabilistic semantic maps of 'when'-clauses. It incorporates character n-gram analysis to capture morphological markers of temporal subordination, in addition to lexical subordinators.
The results show that the method is effective in identifying morphological means of expressing 'when'-clauses, such as switch-reference markers, in languages like Huichol and Quechuan varieties. The semantic maps reveal systematic variation in how languages in the region divide the semantic space of 'when', using a combination of lexical and morphological strategies.
Beyond switch-reference, the maps capture a range of other patterns, including languages that use only lexical subordinators, or those with an inverted distribution compared to Huichol, where the lexical means occupy the top of the map and the morphological means the bottom. The study highlights the value of incorporating character n-grams to complement token-based typological approaches, in order to better understand the diversity of strategies for encoding temporal subordination cross-linguistically.
Stats
"When they came to him he said to them" (Acts 20:18)
"When the men had come to him they said" (Luke 7:20)
"When he came down from the mountain, great crowds followed him" (Matthew 8:1)
Quotes
"Languages can encode temporal subordination lexically, via subordinating conjunctions, and morphologically, by marking the relation on the predicate."
"Systematic cross-linguistic variation among the former can be studied using well-established token-based typological approaches to token-aligned parallel corpora. Variation among different morphological means is instead much harder to tackle and therefore more poorly understood, despite being predominant in several language groups."