|
View: |
Part 1: Document Description
|
|
Citation |
|
|---|---|
|
Title: |
Replication Data for: Argument structure constructions in competition: The Dat-Nom/Nom-Dat alternation in Icelandic |
|
Identification Number: |
doi:10.18710/JXMG1M |
|
Distributor: |
DataverseNO |
|
Date of Distribution: |
2025-09-01 |
|
Version: |
1 |
|
Bibliographic Citation: |
Somers, Joren; Barðdal, Jóhanna, 2025, "Replication Data for: Argument structure constructions in competition: The Dat-Nom/Nom-Dat alternation in Icelandic", https://doi.org/10.18710/JXMG1M, DataverseNO, V1 |
|
Citation |
|
|
Title: |
Replication Data for: Argument structure constructions in competition: The Dat-Nom/Nom-Dat alternation in Icelandic |
|
Identification Number: |
doi:10.18710/JXMG1M |
|
Authoring Entity: |
Somers, Joren (University of Texas at Austin) |
|
Barðdal, Jóhanna (Ghent University) |
|
|
Other identifications and acknowledgements: |
Joren Somers |
|
Other identifications and acknowledgements: |
Joren Somers |
|
Other identifications and acknowledgements: |
Joren Somers |
|
Other identifications and acknowledgements: |
Jóhanna Barðdal |
|
Other identifications and acknowledgements: |
Jóhanna Barðdal |
|
Other identifications and acknowledgements: |
Jóhanna Barðdal |
|
Producer: |
Ghent University |
|
Grant Number: |
BOF-GOA grant nr. 01G01319 |
|
Distributor: |
DataverseNO |
|
Distributor: |
The Tromsø Repository of Language and Linguistics (TROLLing) |
|
Access Authority: |
Somers, Joren |
|
Depositor: |
Somers, Joren |
|
Date of Deposit: |
2025-06-06 |
|
Holdings Information: |
https://doi.org/10.18710/JXMG1M |
|
Study Scope |
|
|
Keywords: |
Arts and Humanities, Icelandic, Corpus research, Argument structure, Dative subjects, Word order, Dat-Nom/Nom-Dat verbs, Dat-Nom verbs, Nom-Dat verbs, Pronouns vs. full NPs, Topicality |
|
Abstract: |
The current Icelandic dataset contains the replication data for Somers & Barðdal (2022) and Somers, Jenset & Barðdal (2024a). The data have also been used in Somers & Barðdal (2023), Somers, Jenset & Barðdal (2024b) and Elens, Somers & Barðdal (2024). The dataset, which has been compiled using the Icelandic Web 2020 corpus, or isTenTen20, contains 200 observations each for 15 Icelandic verbs, thus amounting to a total of 3,000 observations. The verbs in question belong to one of three syntactic classes: (1) Dat-Nom verbs, (2) Nom-Dat verbs and (3) Dat-Nom/Nom-Dat verbs. All tokens have been annotated for lemma, verb class and the internal order of the dative and the nominative arguments. In addition, each constituent, dative and nominative, has been annotated for (1) case, (2) (pro)nominality, (3) pronoun type (if applicable), (4) referentiality, (5) person, (6) number, (7) definiteness, (8) animacy and (9) length. |
|
Time Period: |
2020-08-01-2020-09-30 |
|
Date of Collection: |
2022-05-01-2022-09-30 |
|
Country: |
Iceland |
|
Kind of Data: |
Annotated corpus data |
|
Methodology and Processing |
|
|
Sources Statement |
|
|
Data Sources: |
The data in this dataset were retrieved from the isTenTen20 corpus, which was accessed through the SketchEngine interface (Jakubíček et al. 2013). For more information about the isTenTen20 corpus, see: <a href="https://www.sketchengine.eu/istenten-icelandic-corpus/" target="_blank">https://www.sketchengine.eu/istenten-icelandic-corpus/</a>. Attributing the individual sources has been done to the best extent. However, to retrieve individual sources, use the source website in the csv file and look for the text found in columns LeftCotext, RightCotext, ConcatenatedText. The extracted text fragments that are contained in the data file IcelandicAlternatingVerbs.csv of this dataset only represent non-substantial portions of the sources listed above, and they do not represent coherent larger texts. Therefore, the reuse (including redistribution) of these excerpts is permitted by the exceptions rules in IPR and database protection regulations, such as Fair use (USA cf. <a href="https://www.copyright.gov/fair-use/more-info.html" title="Fair use" target="_blank">US Copyright Act</a>), Fair dealing (UK; cf. <a href="https://www.gov.uk/guidance/exceptions-to-copyright" title="Fair dealing" target="_blank">Exceptions to copyright</a>), the <a href="http://data.europa.eu/eli/dir/1996/9/2019-06-06" title="Lawful users" target="_blank">EU Database Directive</a> (cf. article 8 Rights and obligations of lawful users), "lover, forskrifter, rettsavgjørelser og andre vedtak av offentlig myndighet" (Norway; cf. <a href="https://lovdata.no/lov/2018-06-15-40/§14" title="offentlige vedtak" target="_blank">§ 14 in Åndsverkloven</a>), "uvesentlige deler av databaser" (Norway; cf. <a href="https://lovdata.no/lov/2018-06-15-40/§24" title="uvesentlige deler av databaser" target="_blank">§ 24 in Åndsverkloven</a>), "sitatretten" (Norway; cf. <a href="https://lovdata.no/lov/2018-06-15-40/§29" title="sitatretter" target="_blank">§ 29 in Åndsverkloven</a>). As these excerpts do not represent substantial parts of the reused sources, the redistribution of these excerpts is according to Creative Commons (CC) also permitted if they are extracted from sources that are distributed under Creative Commons licenses (cf. question "Do I always have to comply with the license terms? If not, what are the exceptions?" in the <a href="https://creativecommons.org/faq/" title="CC FAQ" target="_blank">Creative Commons Frequently Asked Questions</a>). |
|
Data Access |
|
|
Notes: |
<p>With the exception of the information in columns B (LeftCotext), C (KeyWord), D (RightCotext) and G (ConcatenatedText) in IcelandicAlternatingVerbs.csv, the dataset "Replication Data for: Argument structure constructions in competition: The Dat-Nom/Nom-Dat alternation in Icelandic" has been marked as dedicated to the public domain, as described here: <a href="https://creativecommons.org/publicdomain/zero/1.0/">https://creativecommons.org/publicdomain/zero/1.0/</a>.</p> <p>Our <a href="https://dataverse.org/best-practices/dataverse-community-norms">Community Norms</a> as well as good scientific practices expect that proper credit is given via citation. Please use the data citation shown on the dataset page.</p> <p>Columns B–D and G in the tabular file IcelandicAlternatingVerbs.csv, contain text fragments that have been extracted from the isTenTen20 corpus, available at <a href=" https://www.sketchengine.eu/istenten-icelandic-corpus/"> https://www.sketchengine.eu/istenten-icelandic-corpus/ </a>, under limitations and exceptions to IPR and database protection regulations. The contribution of the author of the present dataset to these files, as detailed in the ReadMe file, is licensed under the Creative Commons Attribution 4.0 International (CC BY 4.0) license, as described here: <a href="https://creativecommons.org/licenses/by/4.0/">https://creativecommons.org/licenses/by/4.0/</a>.</p> |
|
Other Study Description Materials |
|
|
Related Publications |
|
|
Citation |
|
|
Title: |
Somers, Joren & Jóhanna Barðdal. 2022. Alternating Dat-Nom/Nom-Dat verbs in Icelandic: An exploratory corpus-based analysis. Working Papers in Scandinavian Syntax 107: 83–110. |
|
Bibliographic Citation: |
Somers, Joren & Jóhanna Barðdal. 2022. Alternating Dat-Nom/Nom-Dat verbs in Icelandic: An exploratory corpus-based analysis. Working Papers in Scandinavian Syntax 107: 83–110. |
|
Citation |
|
|
Title: |
Somers, Joren & Jóhanna Barðdal. 2023. Comparing the argument structure of alternating Dat-Nom/Nom-Dat predicates in German and Icelandic. Working Papers in Scandinavian Syntax 108: 1–25. |
|
Bibliographic Citation: |
Somers, Joren & Jóhanna Barðdal. 2023. Comparing the argument structure of alternating Dat-Nom/Nom-Dat predicates in German and Icelandic. Working Papers in Scandinavian Syntax 108: 1–25. |
|
Citation |
|
|
Title: |
Somers, Joren, Gard B. Jenset & Jóhanna Barðdal. 2024a. Argument structure constructions in competition: The Dat-Nom/Nom-Dat alternation in Icelandic. Nordic Journal of Linguistics (online first article). |
|
Identification Number: |
10.1017/S0332586524000039 |
|
Bibliographic Citation: |
Somers, Joren, Gard B. Jenset & Jóhanna Barðdal. 2024a. Argument structure constructions in competition: The Dat-Nom/Nom-Dat alternation in Icelandic. Nordic Journal of Linguistics (online first article). |
|
Citation |
|
|
Title: |
Somers, Joren, Gard B. Jenset & Jóhanna Barðdal. 2024b. Subjecthood and argument structure of synonymous Dat-Nom/Nom-Dat verbs across German and Icelandic. Lingua 312, 103833. |
|
Identification Number: |
10.1016/j.lingua.2024.103833 |
|
Bibliographic Citation: |
Somers, Joren, Gard B. Jenset & Jóhanna Barðdal. 2024b. Subjecthood and argument structure of synonymous Dat-Nom/Nom-Dat verbs across German and Icelandic. Lingua 312, 103833. |
|
Label: |
0_README_IcelandicAlternatingVerbs.txt |
|
Notes: |
text/plain |
|
Label: |
IcelandicAlternatingVerbs.csv |
|
Notes: |
text/comma-separated-values |