41 to 50 of 235 Results
Feb 24, 2025
Janda, Laura Alexis, 2025, "Replication Data for: Going Beyond Words: Engaging Grammar for Insights into Political Discourse", https://doi.org/10.18710/941IQJ, DataverseNO, V1
Dataset description: This dataset contains data in connection with a selection of three of Putin's speeches from 2023 and 2024. The related book chapter also includes analysis of data from Putin's speeches in 2022, and that data is available here: Obukhova, A. (2022). Replication Data for: the Case for Case in Putin’s Speeches. https://doi.org/10.1... |
Feb 20, 2025
Nesset, Tore, 2025, "Russian pora ‘time’ vs. vremja ‘time’", https://doi.org/10.18710/5NIX4N, DataverseNO, V1
In order to shed light on the distribution of the Russian nouns pora and vremja, both of which mean ‘time’, I created a database of examples of both nouns from the Russian National Corpus (RNC, syntactic subcorpus). The database comprises all examples where pora or vremja function as a grammatical subject, a grammatical object, or as an adverbial r... |
Feb 4, 2025
Janda, Laura Alexis, 2025, "Replication Data for: Contextually determined or semantically distinct? The competition between instrumental, long form nominative and short form nominative in Russian predicate adjectives", https://doi.org/10.18710/ZTQURH, DataverseNO, V1
Dataset description This post provides the data and R scripts for analysis of data on the variation between long form nominative, short form nominative, and instrumental case in Russian predicate adjectives in sentences containing an overt copula verb. We analyze the various factors associated with the choice of form of the adjective. This is the a... |
Jan 16, 2025
Enghels, Renata; Jansegers, Marlies; Van Den Driessche, Nele, 2025, "Replication Data for: Reflexiones metodológicas y teóricas sobre el análisis de marcadores pragmáticos: ilustraciones a través del estudio de «es que»", https://doi.org/10.18710/FKG7YX, DataverseNO, V1
Dataset abstract This dataset contains one data file used to create the graphs and tables in the paper "Reflexiones metodológicas y teóricas sobre el análisis de marcadores pragmáticos: ilustraciones a través del estudio de «es que»". It includes 200 tokens of the pragmatic marker es que. These were retrieved from CORMA, a conversational corpus of... |
Jan 15, 2025
Hampe, Beate; Gries, Stefan Th., 2025, "Replication Data for: Syntax from and for discourse II: More on complex sentences as meso-constructions", https://doi.org/10.18710/SIPOUV, DataverseNO, V1
Dataset abstract: The corpus files employed are a subset of 812 files containing spoken language from the British National Corpus (World edition, Oct. 2000) capturing British English in the late 20th century. For a description of the corpus, see http://www.natcorp.ox.ac.uk/archive/worldURG/index.xml. A total of 740 files were chosen because their m... |
Jan 9, 2025
Verbeke, Gil; Mitterer, Holger; Simon, Ellen, 2025, "Replication Data for: Phonetic reduction in native and non-native English speech: Assessing the intelligibility for L2 listeners", https://doi.org/10.18710/OHP3O3, DataverseNO, V1
Dataset abstract This dataset contains the results from 40 L1 British English, 80 Belgian Dutch and 80 European Spanish listeners, who were exposed to English speakers with a General British English, Newcastle and French accent. In the first experiment, participants completed (i) a demographic and linguistic background questionnaire, (ii) an orthog... |
Jan 7, 2025
Sönning, Lukas, 2025, "Biber et al.'s (2016) set of 150 BNC items for the analysis of dispersion measures: Dataset for "Evaluation of text-level measures of lexical dispersion"", https://doi.org/10.18710/ATCQZW, DataverseNO, V1
This dataset contains frequencies for a set of 150 word forms in the BNC. The set of items was compiled by Biber et al. (2016) for the purpose of analyzing the behavior of dispersion measures in different distributional settings. It was therefore assembled to cover a broad range of frequency and dispersion levels. For each form, the dataset lists (... |
Nov 26, 2024
Sönning, Lukas, 2024, "Background data for: Advancing our understanding of dispersion measures in corpus research", https://doi.org/10.18710/FVHTFM, DataverseNO, V1
Dataset description This dataset contains background data and supplementary material for Sönning (forthcoming), a study that looks at the behavior of dispersion measures when applied to text-level frequency data. For the literature survey reported in that study, which examines how dispersion measures are used in corpus-based work, it includes tabul... |
Nov 25, 2024
Sönning, Lukas, 2024, "Background data for: Some obstacles to replication in corpus linguistics", https://doi.org/10.18710/7LNWJX, DataverseNO, V1
This dataset contains tabular files recording occurrences and frequencies of modal verbs in the Brown family corpora; nine modal verbs (can, could, may, might, must, shall, should, will, would) and six corpora are considered (Brown, LOB, Frown, FLOB, BE06, AmE06). Tokens were retrieved using the CQPweb interface provided by the University of Lancas... |
Nov 21, 2024
De Haes, Hanna; Lauwers, Peter; Simon, Ellen, 2024, "Replication Data for: L'acquisition des voyelles nasales en français : une étude acoustique et perceptive sur la prononciation des apprenants néerlandophones belges", https://doi.org/10.18710/JYS0Z1, DataverseNO, V1
Dataset abstract This dataset contains two types of data on the production accuracy of French nasal vowels realized by L1 Belgian Dutch learners, i.e. listener-based and acoustic measures. By focusing on these two measures, we shed light on two different dimensions of production accuracy, i.e. vowel intelligibility and phonetic nativelikeness. Firs... |
