41 to 50 of 240 Results
Feb 22, 2022
Data collection of the UiT Aurora Center for Language Acquisition, Variation & Attrition: The Dynamic Nature of Languages in the Mind. AcqVA Aurora is a UiT Aurora Centre (2020-2024), part of a competitive scheme to strengthen promising research groups. AcqVA Aurora combines solid empirical work with advanced theoretical (and statistical) modeling... |
Sep 26, 2023
Nesset, Tore; Xavier, Kevin, 2023, "Replication Data for: From machine learning to classroom learning: mobile vowels and the Russian preposition v ‘in(to)’", https://doi.org/10.18710/ZCDX1B, DataverseNO, V1
The present study reports on a machine learning experiment concerning mobile vowels in the Russian preposition v ‘in(to)’. Data are extracted from the Russian National Corpus. It is shown that a neural network is able to predict mobile vowels in 97.4% of the cases in our dataset, and a decision tree is used to extract a set of three rules that a la... |
Nov 18, 2023
Zhukova, Valentina, 2023, "Replication Data for: How to threaten in Russian: a constructionist approach", https://doi.org/10.18710/ZFYCOG, DataverseNO, V1
This dataset concerns the data for the article that analyzes various linguistic means to carry out threats in Russian with a special focus on 27 constructions tagged as "Threat" in the Russian Constructicon, a linguistic repository of more than 2200 constructions in the Russian language. The major purpose of the study is to investigate what constit... |
May 31, 2023
De Latte, Fien, 2023, "Replication Data for: (Im)polite uses of vocatives in present-day Madrilenian Spanish", https://doi.org/10.18710/FOBMUQ, DataverseNO, V1
Dataset abstract: This dataset contains one datafile (.csv) used to create the graphs and tables in the paper "(Im)polite uses of vocatives in present-day Madrilenian Spanish". It includes 534 Spanish vocative tokens, i.e. (pro)nominal terms of direct address (e.g., tío 'dude'), which were retrieved from CORMA, a conversational corpus of peninsular... |
Mar 4, 2025
Schriner, John, 2025, "Replication Data for: Predicting Stress in Russian using Modern Machine-Learning Tools", https://doi.org/10.18710/AAFCJP, DataverseNO, V1
This dataset consists of a TSV file with five columns of data originating in Zaliznyak's Grammar and Dictionary (1977). The data was programmatically scraped from Giella project data (Moshagen et al., 2013) by Spektor (2021). From Spektor (2021), the data was one of four sources in their RusLex application. Once scraped from there, only symbols wer... |
Sep 15, 2023
Janda, Laura Alexis, 2023, "Sources and Targets in Kuteva et al. 2019", https://doi.org/10.18710/FYFNFV, DataverseNO, V1
This dataset is based on examples found in Kuteva et al. 2019: Kuteva, Tania, Bernd Heine, Bo Hong, Haiping Long, Heiko Narrog, and Seongha Rhee. 2019. World Lexicon of Gramaticalization (2nd ed.). Cambridge: Cambridge University Press. Kuteva et al.’s World Lexicon of Grammaticalization (2019) is an inventory of examples of morphological reanalysi... |
Nov 2, 2023
Endresen, Anna; Janda, Laura Alexis; Zhukova, Valentina, 2023, "Replication Data for: Typology of reduplication in Russian: constructions within and beyond a single clause", https://doi.org/10.18710/CYLJCD, DataverseNO, V1
We analyze repetition in Russian from the perspective of the Russian Constructicon which represents over 2200 grammatical constructions described in terms of anchors (fixed elements) and slots (for various filler elements) and fully annotated for their syntactic and semantic characteristics. The Russian Constructicon facilitates the first large-sca... |
Sep 13, 2018
Nesset, Tore, 2018, "Norwegian compounds and their Russian equivalents", https://doi.org/10.18710/0U0KN2, DataverseNO, V1
This post contains the dataset discussed in two related publications: Nesset, Tore (2018a): When a single word is enough: Norwegian compounds and their Russian counterparts. Slovo. http://www.moderna.uu.se/slaviska/slovo/ Nesset, Tore (2018b): How to translate compounds into Russian? Scando-Slavica 64.2. |
Jan 15, 2025
Hampe, Beate; Gries, Stefan Th., 2025, "Replication Data for: Syntax from and for discourse II: More on complex sentences as meso-constructions", https://doi.org/10.18710/SIPOUV, DataverseNO, V1
Dataset abstract: The corpus files employed are a subset of 812 files containing spoken language from the British National Corpus (World edition, Oct. 2000) capturing British English in the late 20th century. For a description of the corpus, see http://www.natcorp.ox.ac.uk/archive/worldURG/index.xml. A total of 740 files were chosen because their m... |
May 14, 2019
Arkhangelskiy, Timofey, 2019, "Replication Data for: Russian verbal borrowings in Udmurt", https://doi.org/10.18710/5N34CG, DataverseNO, V1
This is the dataset used in a study of Russian verbal loans in Udmurt. The files contain lists of Russian verbs found in the Udmurt social media corpus (http://udmurt.web-corpora.net/index_en.html), manually annotated for several features such as aspect or frequencies in different corpora. Abstract: In Udmurt, a Uralic language that has experienced... |
