TROLLing

Featured Dataverses

In order to use this feature you must have at least one published or linked dataverse.

Publish Dataverse

Are you sure you want to publish your dataverse? Once you do so it must remain published.

Publish Dataverse

This dataverse cannot be published because the dataverse it is in has not been published.

Delete Dataverse

Are you sure you want to delete your dataverse? You cannot undelete this dataverse.

61 to 70 of 240 Results

Replication Data for: Los vocativos contraculturales: cambios paradigmáticos y difusión hasta el español coloquial actual May 31, 2023 De Latte, Fien, 2023, "Replication Data for: Los vocativos contraculturales: cambios paradigmáticos y difusión hasta el español coloquial actual", https://doi.org/10.18710/CMOIVT, DataverseNO, V1 Dataset abstract: The two data files in this dataset contain the annotated data used to conduct the Apparent-Time and micro-diachronic analysis presented in the paper "Vocativos contraculturales: cambios paradigmáticos y difusión hasta el español coloquial actual". The first data file contains 1107 tokens of a carefully selected set of vocatives, i...
Replication Data for: When modality and tense meet. The future marker budet ‘will’ in impersonal constructions with the modal adverb možno ‘be possible’ Nov 22, 2023 Zhamaletdinova, Elmira, 2023, "Replication Data for: When modality and tense meet. The future marker budet ‘will’ in impersonal constructions with the modal adverb možno ‘be possible’", https://doi.org/10.18710/MOJBDK, DataverseNO, V1 Dataset description: This is a study of examples of Russian impersonal constructions with the modal word možno ‘can, be possible’ with and without the future copula budet ‘will be,’ i.e., možno + budet + INF and možno + INF. The data was collected in 2020-2021 from the old version of the Russian National Corpus (ruscorpora.ru). In the spreadsheet 0...
Background data (adapted from Jenset & McGillivray 2017) for: Down-sampling from hierarchically structured corpus data Oct 24, 2023 Sönning, Lukas, 2023, "Background data (adapted from Jenset & McGillivray 2017) for: Down-sampling from hierarchically structured corpus data", https://doi.org/10.18710/5KCE4U, DataverseNO, V1 Dataset description This dataset, which is adapted from Jenset and McGillivray (2017), contains tabular files documenting the alternating usage of -(e)th and -(e)s to mark third-person verb inflection in Early Modern English. The data provided by Jenset and McGillivray (2017) are drawn from the PPCEME corpus (Kroch et al. 2004) and cover the period...
Replication Data for: The many guises of productivity: a case-study of Spanish inchoative constructions Sep 13, 2024 Van Hulle, Sven, 2024, "Replication Data for: The many guises of productivity: a case-study of Spanish inchoative constructions", https://doi.org/10.18710/5E8I0T, DataverseNO, V1 The dataset contains the quantitative data used as input for the Principal Components Analysis conducted in the article "The many guises of productivity: a case-study of Spanish inchoative constructions". The data originates from the Spanish Web Corpus (esTenTen18), accessed via Sketch Engine (Kilgariff & Renau 2013). Only the subcorpus for Europea...
Supporting Data for: Enhancing code-switching research through comparable corpora: Introducing the El Paso Bilingual Corpus Jun 18, 2025 Vanhaverbeke, Margot; Enghels, Renata; Parafita Couto, M. Carmen; Ivanova, Iva, 2025, "Supporting Data for: Enhancing code-switching research through comparable corpora: Introducing the El Paso Bilingual Corpus", https://doi.org/10.18710/7LGSXY, DataverseNO, V1 Dataset description: This dataset contains two data files that the related publication is based on. In particular, the data file Dataset_Diminutives contains in total 1886 diminutive constructions extracted from the Bangor Miami Corpus and the El Paso Bilingual Corpus. These constructions are coded for intralinguistic variables relating to the ling...
Romanian Weak Pronoun Choice Data Jun 18, 2014 Gerstenberger, Ciprian-Virgil, 2014, "Romanian Weak Pronoun Choice Data", https://doi.org/10.18710/GSV27M, DataverseNO, V1 The following corpus study shows that soft linguistic constraints are hard to describe and operationalize. In specific contexts, some Romanian clitic pronouns allow a choice between phonological hosts such as in că-mi dai cartea vs. că îmi dai cartea both meaning [that you give me the book]. What determines the choice between subjunction că in că-m...
Replication data for: Who needs particles? A challenge to the classification of particles as a part of speech in Russian Mar 29, 2016 Endresen, Anna; Janda, Laura A.; Reynolds, Robert; Tyers, Francis M., 2016, "Replication data for: Who needs particles? A challenge to the classification of particles as a part of speech in Russian", https://doi.org/10.18710/700FNV, DataverseNO, V1 In 1985, Zwicky argued that “particle” is a pretheoretical notion that should be eliminated from linguistic analysis. We propose a reclassification of Russian particles that implements Zwicky’s directive. Russian particles lack a coherent conceptual basis as a category and many are ambiguous with respect to part of speech. Our corpus analysis of Ru...
Replication data for: Spøkelsesfiske, makrellfotball og traktoregg: norske sammensetninger og konseptuell integrasjon Apr 5, 2016 Nesset, Tore, 2016, "Replication data for: Spøkelsesfiske, makrellfotball og traktoregg: norske sammensetninger og konseptuell integrasjon", https://doi.org/10.18710/R4E1EW, DataverseNO, V1 This database is part of a study of Norwegian compounds from the perspective of cognitive linguistics and conceptual integration published in the Norwegian journal Maal og Minne. The database contains a large number of compounds based on the word fiske ‘fishing’. Here is some information about the study from the article: "Denne artikkelen presenter...
Multi-Dimensional Analysis of Czech Oct 30, 2018 Cvrček, Václav, 2018, "Multi-Dimensional Analysis of Czech", https://doi.org/10.18710/QAJKZW, DataverseNO, V1, UNF:6:5rqhrfGF8iJspOAQER3OCA== [fileUNF] Original data for a general-purpose multi-dimensional analysis model of register variation in Czech. This post contains a CSV data set of 137 linguistic features measured on 3428 Czech text chunks, and an R script which performs a factor analysis on this data set. The results of this factor analysis were used as a basis for an 8-dimensional model o...
Replication data for: Constructions are not predictable but are motivated: evidence from the Spanish completive reflexive Mar 11, 2024 Lewandowski, Wojciech, 2018, "Replication data for: Constructions are not predictable but are motivated: evidence from the Spanish completive reflexive", https://doi.org/10.18710/4QHOBK, DataverseNO, V2, UNF:6:tokJXAhE3MEy0uanXSF5aQ== [fileUNF] Many researchers seem to think that construction grammar posits the existence of just wholly idiosyncratic constructions or form-meaning pairings. However, this idea demonstrates a deep misunderstanding of the approach, since constructions rarely emerge sui generis. Rather, construction grammar aims to balance the fact that some linguistic uses can...

Replication Data for: Los vocativos contraculturales: cambios paradigmáticos y difusión hasta el español coloquial actual

May 31, 2023

De Latte, Fien, 2023, "Replication Data for: Los vocativos contraculturales: cambios paradigmáticos y difusión hasta el español coloquial actual", https://doi.org/10.18710/CMOIVT, DataverseNO, V1

Dataset abstract: The two data files in this dataset contain the annotated data used to conduct the Apparent-Time and micro-diachronic analysis presented in the paper "Vocativos contraculturales: cambios paradigmáticos y difusión hasta el español coloquial actual". The first data file contains 1107 tokens of a carefully selected set of vocatives, i...

Replication Data for: When modality and tense meet. The future marker budet ‘will’ in impersonal constructions with the modal adverb možno ‘be possible’

Nov 22, 2023

Zhamaletdinova, Elmira, 2023, "Replication Data for: When modality and tense meet. The future marker budet ‘will’ in impersonal constructions with the modal adverb možno ‘be possible’", https://doi.org/10.18710/MOJBDK, DataverseNO, V1

Dataset description: This is a study of examples of Russian impersonal constructions with the modal word možno ‘can, be possible’ with and without the future copula budet ‘will be,’ i.e., možno + budet + INF and možno + INF. The data was collected in 2020-2021 from the old version of the Russian National Corpus (ruscorpora.ru). In the spreadsheet 0...

Background data (adapted from Jenset & McGillivray 2017) for: Down-sampling from hierarchically structured corpus data

Oct 24, 2023

Sönning, Lukas, 2023, "Background data (adapted from Jenset & McGillivray 2017) for: Down-sampling from hierarchically structured corpus data", https://doi.org/10.18710/5KCE4U, DataverseNO, V1

Dataset description This dataset, which is adapted from Jenset and McGillivray (2017), contains tabular files documenting the alternating usage of -(e)th and -(e)s to mark third-person verb inflection in Early Modern English. The data provided by Jenset and McGillivray (2017) are drawn from the PPCEME corpus (Kroch et al. 2004) and cover the period...

Replication Data for: The many guises of productivity: a case-study of Spanish inchoative constructions

Sep 13, 2024

Van Hulle, Sven, 2024, "Replication Data for: The many guises of productivity: a case-study of Spanish inchoative constructions", https://doi.org/10.18710/5E8I0T, DataverseNO, V1

The dataset contains the quantitative data used as input for the Principal Components Analysis conducted in the article "The many guises of productivity: a case-study of Spanish inchoative constructions". The data originates from the Spanish Web Corpus (esTenTen18), accessed via Sketch Engine (Kilgariff & Renau 2013). Only the subcorpus for Europea...

Supporting Data for: Enhancing code-switching research through comparable corpora: Introducing the El Paso Bilingual Corpus

Jun 18, 2025

Vanhaverbeke, Margot; Enghels, Renata; Parafita Couto, M. Carmen; Ivanova, Iva, 2025, "Supporting Data for: Enhancing code-switching research through comparable corpora: Introducing the El Paso Bilingual Corpus", https://doi.org/10.18710/7LGSXY, DataverseNO, V1

Dataset description: This dataset contains two data files that the related publication is based on. In particular, the data file Dataset_Diminutives contains in total 1886 diminutive constructions extracted from the Bangor Miami Corpus and the El Paso Bilingual Corpus. These constructions are coded for intralinguistic variables relating to the ling...

Romanian Weak Pronoun Choice Data

Jun 18, 2014

Gerstenberger, Ciprian-Virgil, 2014, "Romanian Weak Pronoun Choice Data", https://doi.org/10.18710/GSV27M, DataverseNO, V1

The following corpus study shows that soft linguistic constraints are hard to describe and operationalize. In specific contexts, some Romanian clitic pronouns allow a choice between phonological hosts such as in că-mi dai cartea vs. că îmi dai cartea both meaning [that you give me the book]. What determines the choice between subjunction că in că-m...

Replication data for: Who needs particles? A challenge to the classification of particles as a part of speech in Russian

Mar 29, 2016

Endresen, Anna; Janda, Laura A.; Reynolds, Robert; Tyers, Francis M., 2016, "Replication data for: Who needs particles? A challenge to the classification of particles as a part of speech in Russian", https://doi.org/10.18710/700FNV, DataverseNO, V1

In 1985, Zwicky argued that “particle” is a pretheoretical notion that should be eliminated from linguistic analysis. We propose a reclassification of Russian particles that implements Zwicky’s directive. Russian particles lack a coherent conceptual basis as a category and many are ambiguous with respect to part of speech. Our corpus analysis of Ru...

Replication data for: Spøkelsesfiske, makrellfotball og traktoregg: norske sammensetninger og konseptuell integrasjon

Apr 5, 2016

Nesset, Tore, 2016, "Replication data for: Spøkelsesfiske, makrellfotball og traktoregg: norske sammensetninger og konseptuell integrasjon", https://doi.org/10.18710/R4E1EW, DataverseNO, V1

This database is part of a study of Norwegian compounds from the perspective of cognitive linguistics and conceptual integration published in the Norwegian journal Maal og Minne. The database contains a large number of compounds based on the word fiske ‘fishing’. Here is some information about the study from the article: "Denne artikkelen presenter...

Multi-Dimensional Analysis of Czech

Oct 30, 2018

Cvrček, Václav, 2018, "Multi-Dimensional Analysis of Czech", https://doi.org/10.18710/QAJKZW, DataverseNO, V1, UNF:6:5rqhrfGF8iJspOAQER3OCA== [fileUNF]

Original data for a general-purpose multi-dimensional analysis model of register variation in Czech. This post contains a CSV data set of 137 linguistic features measured on 3428 Czech text chunks, and an R script which performs a factor analysis on this data set. The results of this factor analysis were used as a basis for an 8-dimensional model o...

Replication data for: Constructions are not predictable but are motivated: evidence from the Spanish completive reflexive

Mar 11, 2024

Lewandowski, Wojciech, 2018, "Replication data for: Constructions are not predictable but are motivated: evidence from the Spanish completive reflexive", https://doi.org/10.18710/4QHOBK, DataverseNO, V2, UNF:6:tokJXAhE3MEy0uanXSF5aQ== [fileUNF]

Many researchers seem to think that construction grammar posits the existence of just wholly idiosyncratic constructions or form-meaning pairings. However, this idea demonstrates a deep misunderstanding of the approach, since constructions rarely emerge sui generis. Rather, construction grammar aims to balance the fact that some linguistic uses can...

Add Data

Share Dataverse

Link Dataverse

Reset Modifications