1 to 10 of 200 Results
Mar 4, 2025
Schriner, John, 2025, "Replication Data for: Predicting Stress in Russian using Modern Machine-Learning Tools", https://doi.org/10.18710/AAFCJP, DataverseNO, V1
This dataset consists of a TSV file with five columns of data originating in Zaliznyak's Grammar and Dictionary (1977). The data was programmatically scraped from Giella project data (Moshagen et al., 2013) by Spektor (2021). From Spektor (2021), the data was one of four sources... |
Feb 26, 2025
O'Neill, Paul, 2025, "Replication Data for: Defective verbs in Portuguese: a morphomic approach", https://doi.org/10.18710/TVYCZL, DataverseNO, V1
This data is used in an article which provides evidence via corpus data and statistical methods that defective verbs in Portuguese constitute a psychological reality for speakers. It looks at the different explanations for defective verbs in Portuguese and argues that the morphom... |
Feb 24, 2025
Janda, Laura Alexis, 2025, "Replication Data for: Going Beyond Words: Engaging Grammar for Insights into Political Discourse", https://doi.org/10.18710/941IQJ, DataverseNO, V1
Dataset description: This dataset contains data in connection with a selection of three of Putin's speeches from 2023 and 2024. The related book chapter also includes analysis of data from Putin's speeches in 2022, and that data is available here: Obukhova, A. (2022). Replication... |
Feb 20, 2025
Nesset, Tore, 2025, "Russian pora ‘time’ vs. vremja ‘time’", https://doi.org/10.18710/5NIX4N, DataverseNO, V1
In order to shed light on the distribution of the Russian nouns pora and vremja, both of which mean ‘time’, I created a database of examples of both nouns from the Russian National Corpus (RNC, syntactic subcorpus). The database comprises all examples where pora or vremja functio... |
Feb 4, 2025
Janda, Laura Alexis, 2025, "Replication Data for: Contextually determined or semantically distinct? The competition between instrumental, long form nominative and short form nominative in Russian predicate adjectives", https://doi.org/10.18710/ZTQURH, DataverseNO, V1
Dataset description This post provides the data and R scripts for analysis of data on the variation between long form nominative, short form nominative, and instrumental case in Russian predicate adjectives in sentences containing an overt copula verb. We analyze the various fact... |
Jan 16, 2025
Enghels, Renata; Jansegers, Marlies; Van Den Driessche, Nele, 2025, "Replication Data for: Reflexiones metodológicas y teóricas sobre el análisis de marcadores pragmáticos: ilustraciones a través del estudio de «es que»", https://doi.org/10.18710/FKG7YX, DataverseNO, V1
Dataset abstract This dataset contains one data file used to create the graphs and tables in the paper "Reflexiones metodológicas y teóricas sobre el análisis de marcadores pragmáticos: ilustraciones a través del estudio de «es que»". It includes 200 tokens of the pragmatic marke... |
Jan 15, 2025
Hampe, Beate; Gries, Stefan Th., 2025, "Replication Data for: Syntax from and for discourse II: More on complex sentences as meso-constructions", https://doi.org/10.18710/SIPOUV, DataverseNO, V1
Dataset abstract: The corpus files employed are a subset of 812 files containing spoken language from the British National Corpus (World edition, Oct. 2000) capturing British English in the late 20th century. For a description of the corpus, see http://www.natcorp.ox.ac.uk/archiv... |
Jan 9, 2025
Verbeke, Gil; Mitterer, Holger; Simon, Ellen, 2025, "Replication Data for: Phonetic reduction in native and non-native English speech: Assessing the intelligibility for L2 listeners", https://doi.org/10.18710/OHP3O3, DataverseNO, V1
Dataset abstract This dataset contains the results from 40 L1 British English, 80 Belgian Dutch and 80 European Spanish listeners, who were exposed to English speakers with a General British English, Newcastle and French accent. In the first experiment, participants completed (i)... |
Jan 7, 2025
Sönning, Lukas, 2025, "Biber et al.'s (2016) set of 150 BNC items for the analysis of dispersion measures: Dataset for "Evaluation of text-level measures of lexical dispersion"", https://doi.org/10.18710/ATCQZW, DataverseNO, V1
This dataset contains frequencies for a set of 150 word forms in the BNC. The set of items was compiled by Biber et al. (2016) for the purpose of analyzing the behavior of dispersion measures in different distributional settings. It was therefore assembled to cover a broad range... |
Nov 26, 2024
Schützler, Ole, 2024, "Investigating rhoticity in Scottish Standard English with sociolinguistic interviews and corpus data: Auditory ratings", https://doi.org/10.18710/XKCCV1, DataverseNO, V1
The dataset is used in an article published in World Englishes. In the article, I test and discuss methodological and theoretical implications of combining classic sociolinguistic interview data with data from a spoken corpus. Thus, the data combined in this set stem from two sou... |