Featured Dataverses

In order to use this feature you must have at least one published or linked dataverse.

Publish Dataverse

Are you sure you want to publish your dataverse? Once you do so it must remain published.

Publish Dataverse

This dataverse cannot be published because the dataverse it is in has not been published.

Delete Dataverse

Are you sure you want to delete your dataverse? You cannot undelete this dataverse.

Advanced Search

61 to 70 of 240 Results
May 31, 2023
De Latte, Fien, 2023, "Replication Data for: Los vocativos contraculturales: cambios paradigmáticos y difusión hasta el español coloquial actual", https://doi.org/10.18710/CMOIVT, DataverseNO, V1
Dataset abstract: The two data files in this dataset contain the annotated data used to conduct the Apparent-Time and micro-diachronic analysis presented in the paper "Vocativos contraculturales: cambios paradigmáticos y difusión hasta el español coloquial actual". The first data file contains 1107 tokens of a carefully selected set of vocatives, i...
Nov 22, 2023
Zhamaletdinova, Elmira, 2023, "Replication Data for: When modality and tense meet. The future marker budet ‘will’ in impersonal constructions with the modal adverb možno ‘be possible’", https://doi.org/10.18710/MOJBDK, DataverseNO, V1
Dataset description: This is a study of examples of Russian impersonal constructions with the modal word možno ‘can, be possible’ with and without the future copula budet ‘will be,’ i.e., možno + budet + INF and možno + INF. The data was collected in 2020-2021 from the old version of the Russian National Corpus (ruscorpora.ru). In the spreadsheet 0...
Oct 24, 2023
Sönning, Lukas, 2023, "Background data (adapted from Jenset & McGillivray 2017) for: Down-sampling from hierarchically structured corpus data", https://doi.org/10.18710/5KCE4U, DataverseNO, V1
Dataset description This dataset, which is adapted from Jenset and McGillivray (2017), contains tabular files documenting the alternating usage of -(e)th and -(e)s to mark third-person verb inflection in Early Modern English. The data provided by Jenset and McGillivray (2017) are drawn from the PPCEME corpus (Kroch et al. 2004) and cover the period...
Sep 13, 2024
Van Hulle, Sven, 2024, "Replication Data for: The many guises of productivity: a case-study of Spanish inchoative constructions", https://doi.org/10.18710/5E8I0T, DataverseNO, V1
The dataset contains the quantitative data used as input for the Principal Components Analysis conducted in the article "The many guises of productivity: a case-study of Spanish inchoative constructions". The data originates from the Spanish Web Corpus (esTenTen18), accessed via Sketch Engine (Kilgariff & Renau 2013). Only the subcorpus for Europea...
Jun 18, 2025
Vanhaverbeke, Margot; Enghels, Renata; Parafita Couto, M. Carmen; Ivanova, Iva, 2025, "Supporting Data for: Enhancing code-switching research through comparable corpora: Introducing the El Paso Bilingual Corpus", https://doi.org/10.18710/7LGSXY, DataverseNO, V1
Dataset description: This dataset contains two data files that the related publication is based on. In particular, the data file Dataset_Diminutives contains in total 1886 diminutive constructions extracted from the Bangor Miami Corpus and the El Paso Bilingual Corpus. These constructions are coded for intralinguistic variables relating to the ling...
Jun 18, 2014
Gerstenberger, Ciprian-Virgil, 2014, "Romanian Weak Pronoun Choice Data", https://doi.org/10.18710/GSV27M, DataverseNO, V1
The following corpus study shows that soft linguistic constraints are hard to describe and operationalize. In specific contexts, some Romanian clitic pronouns allow a choice between phonological hosts such as in că-mi dai cartea vs. că îmi dai cartea both meaning [that you give me the book]. What determines the choice between subjunction că in că-m...
Mar 29, 2016
Endresen, Anna; Janda, Laura A.; Reynolds, Robert; Tyers, Francis M., 2016, "Replication data for: Who needs particles? A challenge to the classification of particles as a part of speech in Russian", https://doi.org/10.18710/700FNV, DataverseNO, V1
In 1985, Zwicky argued that “particle” is a pretheoretical notion that should be eliminated from linguistic analysis. We propose a reclassification of Russian particles that implements Zwicky’s directive. Russian particles lack a coherent conceptual basis as a category and many are ambiguous with respect to part of speech. Our corpus analysis of Ru...
Apr 5, 2016
Nesset, Tore, 2016, "Replication data for: Spøkelsesfiske, makrellfotball og traktoregg: norske sammensetninger og konseptuell integrasjon", https://doi.org/10.18710/R4E1EW, DataverseNO, V1
This database is part of a study of Norwegian compounds from the perspective of cognitive linguistics and conceptual integration published in the Norwegian journal Maal og Minne. The database contains a large number of compounds based on the word fiske ‘fishing’. Here is some information about the study from the article: "Denne artikkelen presenter...
Oct 30, 2018
Cvrček, Václav, 2018, "Multi-Dimensional Analysis of Czech", https://doi.org/10.18710/QAJKZW, DataverseNO, V1, UNF:6:5rqhrfGF8iJspOAQER3OCA== [fileUNF]
Original data for a general-purpose multi-dimensional analysis model of register variation in Czech. This post contains a CSV data set of 137 linguistic features measured on 3428 Czech text chunks, and an R script which performs a factor analysis on this data set. The results of this factor analysis were used as a basis for an 8-dimensional model o...
Mar 11, 2024
Lewandowski, Wojciech, 2018, "Replication data for: Constructions are not predictable but are motivated: evidence from the Spanish completive reflexive", https://doi.org/10.18710/4QHOBK, DataverseNO, V2, UNF:6:tokJXAhE3MEy0uanXSF5aQ== [fileUNF]
Many researchers seem to think that construction grammar posits the existence of just wholly idiosyncratic constructions or form-meaning pairings. However, this idea demonstrates a deep misunderstanding of the approach, since constructions rarely emerge sui generis. Rather, construction grammar aims to balance the fact that some linguistic uses can...
Add Data

Log in to create a dataverse or add a dataset.

Share Dataverse

Share this dataverse on your favorite social media networks.

Link Dataverse
Reset Modifications

Are you sure you want to reset the selected metadata fields? If you do this, any customizations (hidden, required, optional) you have done will no longer appear.