61 to 70 of 240 Results
May 31, 2023
De Latte, Fien, 2023, "Replication Data for: Los vocativos contraculturales: cambios paradigmáticos y difusión hasta el español coloquial actual", https://doi.org/10.18710/CMOIVT, DataverseNO, V1
Dataset abstract: The two data files in this dataset contain the annotated data used to conduct the Apparent-Time and micro-diachronic analysis presented in the paper "Vocativos contraculturales: cambios paradigmáticos y difusión hasta el español coloquial actual". The first data file contains 1107 tokens of a carefully selected set of vocatives, i... |
Nov 22, 2023
Zhamaletdinova, Elmira, 2023, "Replication Data for: When modality and tense meet. The future marker budet ‘will’ in impersonal constructions with the modal adverb možno ‘be possible’", https://doi.org/10.18710/MOJBDK, DataverseNO, V1
Dataset description: This is a study of examples of Russian impersonal constructions with the modal word možno ‘can, be possible’ with and without the future copula budet ‘will be,’ i.e., možno + budet + INF and možno + INF. The data was collected in 2020-2021 from the old version of the Russian National Corpus (ruscorpora.ru). In the spreadsheet 0... |
Oct 24, 2023
Sönning, Lukas, 2023, "Background data (adapted from Jenset & McGillivray 2017) for: Down-sampling from hierarchically structured corpus data", https://doi.org/10.18710/5KCE4U, DataverseNO, V1
Dataset description This dataset, which is adapted from Jenset and McGillivray (2017), contains tabular files documenting the alternating usage of -(e)th and -(e)s to mark third-person verb inflection in Early Modern English. The data provided by Jenset and McGillivray (2017) are drawn from the PPCEME corpus (Kroch et al. 2004) and cover the period... |
Sep 13, 2024
Van Hulle, Sven, 2024, "Replication Data for: The many guises of productivity: a case-study of Spanish inchoative constructions", https://doi.org/10.18710/5E8I0T, DataverseNO, V1
The dataset contains the quantitative data used as input for the Principal Components Analysis conducted in the article "The many guises of productivity: a case-study of Spanish inchoative constructions". The data originates from the Spanish Web Corpus (esTenTen18), accessed via Sketch Engine (Kilgariff & Renau 2013). Only the subcorpus for Europea... |
Jun 18, 2025
Vanhaverbeke, Margot; Enghels, Renata; Parafita Couto, M. Carmen; Ivanova, Iva, 2025, "Supporting Data for: Enhancing code-switching research through comparable corpora: Introducing the El Paso Bilingual Corpus", https://doi.org/10.18710/7LGSXY, DataverseNO, V1
Dataset description: This dataset contains two data files that the related publication is based on. In particular, the data file Dataset_Diminutives contains in total 1886 diminutive constructions extracted from the Bangor Miami Corpus and the El Paso Bilingual Corpus. These constructions are coded for intralinguistic variables relating to the ling... |
Jun 18, 2014
Gerstenberger, Ciprian-Virgil, 2014, "Romanian Weak Pronoun Choice Data", https://doi.org/10.18710/GSV27M, DataverseNO, V1
The following corpus study shows that soft linguistic constraints are hard to describe and operationalize. In specific contexts, some Romanian clitic pronouns allow a choice between phonological hosts such as in că-mi dai cartea vs. că îmi dai cartea both meaning [that you give me the book]. What determines the choice between subjunction că in că-m... |
Mar 29, 2016
Endresen, Anna; Janda, Laura A.; Reynolds, Robert; Tyers, Francis M., 2016, "Replication data for: Who needs particles? A challenge to the classification of particles as a part of speech in Russian", https://doi.org/10.18710/700FNV, DataverseNO, V1
In 1985, Zwicky argued that “particle” is a pretheoretical notion that should be eliminated from linguistic analysis. We propose a reclassification of Russian particles that implements Zwicky’s directive. Russian particles lack a coherent conceptual basis as a category and many are ambiguous with respect to part of speech. Our corpus analysis of Ru... |
Apr 5, 2016
Nesset, Tore, 2016, "Replication data for: Spøkelsesfiske, makrellfotball og traktoregg: norske sammensetninger og konseptuell integrasjon", https://doi.org/10.18710/R4E1EW, DataverseNO, V1
This database is part of a study of Norwegian compounds from the perspective of cognitive linguistics and conceptual integration published in the Norwegian journal Maal og Minne. The database contains a large number of compounds based on the word fiske ‘fishing’. Here is some information about the study from the article: "Denne artikkelen presenter... |
Oct 30, 2018
Cvrček, Václav, 2018, "Multi-Dimensional Analysis of Czech", https://doi.org/10.18710/QAJKZW, DataverseNO, V1, UNF:6:5rqhrfGF8iJspOAQER3OCA== [fileUNF]
Original data for a general-purpose multi-dimensional analysis model of register variation in Czech. This post contains a CSV data set of 137 linguistic features measured on 3428 Czech text chunks, and an R script which performs a factor analysis on this data set. The results of this factor analysis were used as a basis for an 8-dimensional model o... |
Mar 11, 2024
Lewandowski, Wojciech, 2018, "Replication data for: Constructions are not predictable but are motivated: evidence from the Spanish completive reflexive", https://doi.org/10.18710/4QHOBK, DataverseNO, V2, UNF:6:tokJXAhE3MEy0uanXSF5aQ== [fileUNF]
Many researchers seem to think that construction grammar posits the existence of just wholly idiosyncratic constructions or form-meaning pairings. However, this idea demonstrates a deep misunderstanding of the approach, since constructions rarely emerge sui generis. Rather, construction grammar aims to balance the fact that some linguistic uses can... |
