View: |
Part 1: Document Description
|
Citation |
|
---|---|
Title: |
Replication Data for: Russian verbal borrowings in Udmurt |
Identification Number: |
doi:10.18710/5N34CG |
Distributor: |
DataverseNO |
Date of Distribution: |
2019-05-14 |
Version: |
1 |
Bibliographic Citation: |
Arkhangelskiy, Timofey, 2019, "Replication Data for: Russian verbal borrowings in Udmurt", https://doi.org/10.18710/5N34CG, DataverseNO, V1 |
Citation |
|
Title: |
Replication Data for: Russian verbal borrowings in Udmurt |
Identification Number: |
doi:10.18710/5N34CG |
Authoring Entity: |
Arkhangelskiy, Timofey (University of Hamburg) |
Producer: |
University of Hamburg |
Date of Production: |
2018-03-22 |
Distributor: |
DataverseNO |
Distributor: |
The Tromsø Repository of Language and Linguistics (TROLLing) |
Access Authority: |
Arkhangelskiy, Timofey |
Depositor: |
Arkhangelskiy, Timofey |
Date of Deposit: |
2019-05-10 |
Holdings Information: |
https://doi.org/10.18710/5N34CG |
Study Scope |
|
Keywords: |
Arts and Humanities, language contact, verbal borrowings, Udmurt, Russian, social media, corpus |
Abstract: |
This is the dataset used in a study of Russian verbal loans in Udmurt. The files contain lists of Russian verbs found in the Udmurt social media corpus (http://udmurt.web-corpora.net/index_en.html), manually annotated for several features such as aspect or frequencies in different corpora. |
Abstract: In Udmurt, a Uralic language that has experienced long and extensive contact with the dominant Russian language, all four typologically relevant strategies of verbal borrowing are attested. This is unusual both cross-linguistically and for the Uralic family. The paper investigates these strategies and the factors that govern their choice. It turns out that, although free variation plays a major role in the distribution of strategies, there are also several important morphological, stylistic and areal factors. By analyzing these factors and the available historical data, I propose a diachronic explanation of the currently observed distribution. The study is mostly based on corpus data collected from contemporary Udmurt-language social media. |
|
Time Period: |
2007-01-01-2018-02-28 |
Date of Collection: |
2017-12-01-2018-03-22 |
Country: |
Russian Federation, Russian Federation, Russian Federation |
Geographic Coverage: |
Udmurtia, Tatarstan, Bashkortostan |
Kind of Data: |
corpus data |
Methodology and Processing |
|
Sources Statement |
|
Data Sources: |
The data were extracted from the corpus of Udmurt-language social media (http://udmurt.web-corpora.net/index_en.html). More information about the corpus and the kind of data it contains can be found in the following paper: Arkhangelskiy, Timofey & Ekaterina Georgieva. 2018. Sound-aligned corpus of Udmurt dialectal texts. In: Pirinen, Tommi A. (ed.), Proceedings of the 4th International Workshop for Computational Linguistics for Uralic Languages (IWCLUL 2018), 26–38. Stroudsburg (PA): Association for Computational Linguistics. |
Data Access |
|
Other Study Description Materials |
|
Related Publications |
|
Citation |
|
Title: |
Arkhangelskiy, Timofey. "Russian verbal borrowings in Udmurt" Folia Linguistica, vol. 53, no. 2, 2019, pp. 519-552. https://doi.org/10.1515/flin-2019-2019 |
Identification Number: |
10.1515/flin-2019-2019 |
Bibliographic Citation: |
Arkhangelskiy, Timofey. "Russian verbal borrowings in Udmurt" Folia Linguistica, vol. 53, no. 2, 2019, pp. 519-552. https://doi.org/10.1515/flin-2019-2019 |
Label: |
00_ReadMe.txt |
Text: |
The readme file. |
Notes: |
text/plain |
Label: |
all_rus_verbs_in_udmurt.txt |
Text: |
List of lemmata of all Russian verbs borrowed in the Udmurt social media corpus, annotated for several features. |
Notes: |
text/plain |
Label: |
ins_rus_verbs_occ.txt |
Text: |
List of word forms of all Russian verbs borrowed via Direct or Indirect insertion in the Udmurt social media corpus. |
Notes: |
text/plain |