Replication Data for: Russian verbal borrowings in Udmurt (doi:10.18710/5N34CG)

View:

Part 1: Document Description
Part 2: Study Description
Part 5: Other Study-Related Materials
Entire Codebook

(external link)

Document Description
Citation
Title:	Replication Data for: Russian verbal borrowings in Udmurt
Identification Number:	doi:10.18710/5N34CG
Distributor:	DataverseNO
Date of Distribution:	2019-05-14
Version:	1
Bibliographic Citation:	Arkhangelskiy, Timofey, 2019, "Replication Data for: Russian verbal borrowings in Udmurt", https://doi.org/10.18710/5N34CG, DataverseNO, V1
Study Description
Citation
Title:	Replication Data for: Russian verbal borrowings in Udmurt
Identification Number:	doi:10.18710/5N34CG
Authoring Entity:	Arkhangelskiy, Timofey (University of Hamburg)
Producer:	University of Hamburg
Date of Production:	2018-03-22
Distributor:	DataverseNO
Distributor:	The Tromsø Repository of Language and Linguistics (TROLLing)
Access Authority:	Arkhangelskiy, Timofey
Depositor:	Arkhangelskiy, Timofey
Date of Deposit:	2019-05-10
Holdings Information:	https://doi.org/10.18710/5N34CG
Study Scope
Keywords:	Arts and Humanities, language contact, verbal borrowings, Udmurt, Russian, social media, corpus
Abstract:	This is the dataset used in a study of Russian verbal loans in Udmurt. The files contain lists of Russian verbs found in the Udmurt social media corpus (http://udmurt.web-corpora.net/index_en.html), manually annotated for several features such as aspect or frequencies in different corpora.
	Abstract: In Udmurt, a Uralic language that has experienced long and extensive contact with the dominant Russian language, all four typologically relevant strategies of verbal borrowing are attested. This is unusual both cross-linguistically and for the Uralic family. The paper investigates these strategies and the factors that govern their choice. It turns out that, although free variation plays a major role in the distribution of strategies, there are also several important morphological, stylistic and areal factors. By analyzing these factors and the available historical data, I propose a diachronic explanation of the currently observed distribution. The study is mostly based on corpus data collected from contemporary Udmurt-language social media.
Time Period:	2007-01-01-2018-02-28
Date of Collection:	2017-12-01-2018-03-22
Country:	Russian Federation, Russian Federation, Russian Federation
Geographic Coverage:	Udmurtia, Tatarstan, Bashkortostan
Kind of Data:	corpus data
Methodology and Processing
Sources Statement
Data Sources:	The data were extracted from the corpus of Udmurt-language social media (http://udmurt.web-corpora.net/index_en.html). More information about the corpus and the kind of data it contains can be found in the following paper: Arkhangelskiy, Timofey & Ekaterina Georgieva. 2018. Sound-aligned corpus of Udmurt dialectal texts. In: Pirinen, Tommi A. (ed.), Proceedings of the 4th International Workshop for Computational Linguistics for Uralic Languages (IWCLUL 2018), 26–38. Stroudsburg (PA): Association for Computational Linguistics.
Data Access
Other Study Description Materials
Related Publications
Citation
Title:	Arkhangelskiy, Timofey. "Russian verbal borrowings in Udmurt" Folia Linguistica, vol. 53, no. 2, 2019, pp. 519-552. https://doi.org/10.1515/flin-2019-2019
Identification Number:	10.1515/flin-2019-2019
Bibliographic Citation:	Arkhangelskiy, Timofey. "Russian verbal borrowings in Udmurt" Folia Linguistica, vol. 53, no. 2, 2019, pp. 519-552. https://doi.org/10.1515/flin-2019-2019
Other Study-Related Materials
Label:	00_ReadMe.txt
Text:	The readme file.
Notes:	text/plain
Other Study-Related Materials
Label:	all_rus_verbs_in_udmurt.txt
Text:	List of lemmata of all Russian verbs borrowed in the Udmurt social media corpus, annotated for several features.
Notes:	text/plain
Other Study-Related Materials
Label:	ins_rus_verbs_occ.txt
Text:	List of word forms of all Russian verbs borrowed via Direct or Indirect insertion in the Udmurt social media corpus.
Notes:	text/plain