Replication Data for: Russian verbal borrowings in Udmurthttps://doi.org/10.18710/5N34CGArkhangelskiy, TimofeyDataverseNO2019-05-142023-09-28T19:44:22ZThis is the dataset used in a study of Russian verbal loans in Udmurt. The files contain lists of Russian verbs found in the Udmurt social media corpus (http://udmurt.web-corpora.net/index_en.html), manually annotated for several features such as aspect or frequencies in different corpora.Abstract: In Udmurt, a Uralic language that has experienced long and extensive contact with the dominant Russian language, all four typologically relevant strategies of verbal borrowing are attested. This is unusual both cross-linguistically and for the Uralic family. The paper investigates these strategies and the factors that govern their choice. It turns out that, although free variation plays a major role in the distribution of strategies, there are also several important morphological, stylistic and areal factors. By analyzing these factors and the available historical data, I propose a diachronic explanation of the currently observed distribution. The study is mostly based on corpus data collected from contemporary Udmurt-language social media.Arts and Humanitieslanguage contactverbal borrowingsUdmurtRussiansocial mediacorpusEnglishArkhangelskiy, Timofey. "Russian verbal borrowings in Udmurt" Folia Linguistica, vol. 53, no. 2, 2019, pp. 519-552. https://doi.org/10.1515/flin-2019-2019, doi, 10.1515/flin-2019-2019, https://doi.org/10.1515/flin-2019-20192018-03-22Arkhangelskiy, Timofey2019-05-102007-01-012018-02-282017-12-012018-03-22corpus dataThe data were extracted from the corpus of Udmurt-language social media (http://udmurt.web-corpora.net/index_en.html). More information about the corpus and the kind of data it contains can be found in the following paper:
Arkhangelskiy, Timofey & Ekaterina Georgieva. 2018. Sound-aligned corpus of Udmurt dialectal texts. In: Pirinen, Tommi A. (ed.), Proceedings of the 4th International Workshop for Computational Linguistics for Uralic Languages (IWCLUL 2018), 26–38. Stroudsburg (PA): Association for Computational Linguistics.Russian FederationUdmurtiaTatarstanRussian FederationRussian FederationBashkortostanCC0 1.0