Replication Data for: Argument structure constructions in competition: The Dat-Nom/Nom-Dat alternation in Icelandic (doi:10.18710/JXMG1M)

View:

Part 1: Document Description
Part 2: Study Description
Part 5: Other Study-Related Materials
Entire Codebook

(external link) (external link) (external link) (external link)

Document Description

Citation

Title:

Replication Data for: Argument structure constructions in competition: The Dat-Nom/Nom-Dat alternation in Icelandic

Identification Number:

doi:10.18710/JXMG1M

Distributor:

DataverseNO

Date of Distribution:

2025-09-01

Version:

1

Bibliographic Citation:

Somers, Joren; Barðdal, Jóhanna, 2025, "Replication Data for: Argument structure constructions in competition: The Dat-Nom/Nom-Dat alternation in Icelandic", https://doi.org/10.18710/JXMG1M, DataverseNO, V1

Study Description

Citation

Title:

Replication Data for: Argument structure constructions in competition: The Dat-Nom/Nom-Dat alternation in Icelandic

Identification Number:

doi:10.18710/JXMG1M

Authoring Entity:

Somers, Joren (University of Texas at Austin)

Barðdal, Jóhanna (Ghent University)

Other identifications and acknowledgements:

Joren Somers

Other identifications and acknowledgements:

Joren Somers

Other identifications and acknowledgements:

Joren Somers

Other identifications and acknowledgements:

Jóhanna Barðdal

Other identifications and acknowledgements:

Jóhanna Barðdal

Other identifications and acknowledgements:

Jóhanna Barðdal

Producer:

Ghent University

Grant Number:

BOF-GOA grant nr. 01G01319

Distributor:

DataverseNO

Distributor:

The Tromsø Repository of Language and Linguistics (TROLLing)

Access Authority:

Somers, Joren

Depositor:

Somers, Joren

Date of Deposit:

2025-06-06

Holdings Information:

https://doi.org/10.18710/JXMG1M

Study Scope

Keywords:

Arts and Humanities, Icelandic, Corpus research, Argument structure, Dative subjects, Word order, Dat-Nom/Nom-Dat verbs, Dat-Nom verbs, Nom-Dat verbs, Pronouns vs. full NPs, Topicality

Abstract:

The current Icelandic dataset contains the replication data for Somers & Barðdal (2022) and Somers, Jenset & Barðdal (2024a). The data have also been used in Somers & Barðdal (2023), Somers, Jenset & Barðdal (2024b) and Elens, Somers & Barðdal (2024). The dataset, which has been compiled using the Icelandic Web 2020 corpus, or isTenTen20, contains 200 observations each for 15 Icelandic verbs, thus amounting to a total of 3,000 observations. The verbs in question belong to one of three syntactic classes: (1) Dat-Nom verbs, (2) Nom-Dat verbs and (3) Dat-Nom/Nom-Dat verbs. All tokens have been annotated for lemma, verb class and the internal order of the dative and the nominative arguments. In addition, each constituent, dative and nominative, has been annotated for (1) case, (2) (pro)nominality, (3) pronoun type (if applicable), (4) referentiality, (5) person, (6) number, (7) definiteness, (8) animacy and (9) length.

Time Period:

2020-08-01-2020-09-30

Date of Collection:

2022-05-01-2022-09-30

Country:

Iceland

Kind of Data:

Annotated corpus data

Methodology and Processing

Sources Statement

Data Sources:

The data in this dataset were retrieved from the isTenTen20 corpus, which was accessed through the SketchEngine interface (Jakubíček et al. 2013). For more information about the isTenTen20 corpus, see: <a href="https://www.sketchengine.eu/istenten-icelandic-corpus/" target="_blank">https://www.sketchengine.eu/istenten-icelandic-corpus/</a>. Attributing the individual sources has been done to the best extent. However, to retrieve individual sources, use the source website in the csv file and look for the text found in columns LeftCotext, RightCotext, ConcatenatedText. The extracted text fragments that are contained in the data file IcelandicAlternatingVerbs.csv of this dataset only represent non-substantial portions of the sources listed above, and they do not represent coherent larger texts. Therefore, the reuse (including redistribution) of these excerpts is permitted by the exceptions rules in IPR and database protection regulations, such as Fair use (USA cf. <a href="https://www.copyright.gov/fair-use/more-info.html" title="Fair use" target="_blank">US Copyright Act</a>), Fair dealing (UK; cf. <a href="https://www.gov.uk/guidance/exceptions-to-copyright" title="Fair dealing" target="_blank">Exceptions to copyright</a>), the <a href="http://data.europa.eu/eli/dir/1996/9/2019-06-06" title="Lawful users" target="_blank">EU Database Directive</a> (cf. article 8 Rights and obligations of lawful users), "lover, forskrifter, rettsavgjørelser og andre vedtak av offentlig myndighet" (Norway; cf. <a href="https://lovdata.no/lov/2018-06-15-40/§14" title="offentlige vedtak" target="_blank">§ 14 in Åndsverkloven</a>), "uvesentlige deler av databaser" (Norway; cf. <a href="https://lovdata.no/lov/2018-06-15-40/§24" title="uvesentlige deler av databaser" target="_blank">§ 24 in Åndsverkloven</a>), "sitatretten" (Norway; cf. <a href="https://lovdata.no/lov/2018-06-15-40/§29" title="sitatretter" target="_blank">§ 29 in Åndsverkloven</a>). As these excerpts do not represent substantial parts of the reused sources, the redistribution of these excerpts is according to Creative Commons (CC) also permitted if they are extracted from sources that are distributed under Creative Commons licenses (cf. question "Do I always have to comply with the license terms? If not, what are the exceptions?" in the <a href="https://creativecommons.org/faq/" title="CC FAQ" target="_blank">Creative Commons Frequently Asked Questions</a>).

Data Access

Notes:

<p>With the exception of the information in columns B (LeftCotext), C (KeyWord), D (RightCotext) and G (ConcatenatedText) in IcelandicAlternatingVerbs.csv, the dataset "Replication Data for: Argument structure constructions in competition: The Dat-Nom/Nom-Dat alternation in Icelandic" has been marked as dedicated to the public domain, as described here: <a href="https://creativecommons.org/publicdomain/zero/1.0/">https://creativecommons.org/publicdomain/zero/1.0/</a>.</p> <p>Our <a href="https://dataverse.org/best-practices/dataverse-community-norms">Community Norms</a> as well as good scientific practices expect that proper credit is given via citation. Please use the data citation shown on the dataset page.</p> <p>Columns B–D and G in the tabular file IcelandicAlternatingVerbs.csv, contain text fragments that have been extracted from the isTenTen20 corpus, available at <a href=" https://www.sketchengine.eu/istenten-icelandic-corpus/"> https://www.sketchengine.eu/istenten-icelandic-corpus/ </a>, under limitations and exceptions to IPR and database protection regulations. The contribution of the author of the present dataset to these files, as detailed in the ReadMe file, is licensed under the Creative Commons Attribution 4.0 International (CC BY 4.0) license, as described here: <a href="https://creativecommons.org/licenses/by/4.0/">https://creativecommons.org/licenses/by/4.0/</a>.</p>

Other Study Description Materials

Related Publications

Citation

Title:

Somers, Joren & Jóhanna Barðdal. 2022. Alternating Dat-Nom/Nom-Dat verbs in Icelandic: An exploratory corpus-based analysis. Working Papers in Scandinavian Syntax 107: 83–110.

Bibliographic Citation:

Somers, Joren & Jóhanna Barðdal. 2022. Alternating Dat-Nom/Nom-Dat verbs in Icelandic: An exploratory corpus-based analysis. Working Papers in Scandinavian Syntax 107: 83–110.

Citation

Title:

Somers, Joren & Jóhanna Barðdal. 2023. Comparing the argument structure of alternating Dat-Nom/Nom-Dat predicates in German and Icelandic. Working Papers in Scandinavian Syntax 108: 1–25.

Bibliographic Citation:

Somers, Joren & Jóhanna Barðdal. 2023. Comparing the argument structure of alternating Dat-Nom/Nom-Dat predicates in German and Icelandic. Working Papers in Scandinavian Syntax 108: 1–25.

Citation

Title:

Somers, Joren, Gard B. Jenset & Jóhanna Barðdal. 2024a. Argument structure constructions in competition: The Dat-Nom/Nom-Dat alternation in Icelandic. Nordic Journal of Linguistics (online first article).

Identification Number:

10.1017/S0332586524000039

Bibliographic Citation:

Somers, Joren, Gard B. Jenset & Jóhanna Barðdal. 2024a. Argument structure constructions in competition: The Dat-Nom/Nom-Dat alternation in Icelandic. Nordic Journal of Linguistics (online first article).

Citation

Title:

Somers, Joren, Gard B. Jenset & Jóhanna Barðdal. 2024b. Subjecthood and argument structure of synonymous Dat-Nom/Nom-Dat verbs across German and Icelandic. Lingua 312, 103833.

Identification Number:

10.1016/j.lingua.2024.103833

Bibliographic Citation:

Somers, Joren, Gard B. Jenset & Jóhanna Barðdal. 2024b. Subjecthood and argument structure of synonymous Dat-Nom/Nom-Dat verbs across German and Icelandic. Lingua 312, 103833.

Other Study-Related Materials

Label:

0_README_IcelandicAlternatingVerbs.txt

Notes:

text/plain

Other Study-Related Materials

Label:

IcelandicAlternatingVerbs.csv

Notes:

text/comma-separated-values