Persistent Identifier
|
doi:10.18710/JXMG1M |
Publication Date
|
2025-09-01 |
Title
| Replication Data for: Argument structure constructions in competition: The Dat-Nom/Nom-Dat alternation in Icelandic |
Author
| Somers, Joren (University of Texas at Austin) - ORCID: 0000-0002-3815-3139
Barðdal, Jóhanna (Ghent University) - ORCID: 0000-0003-0164-4249 |
Point of Contact
|
Use email button above to contact.
Somers, Joren |
Description
| The current Icelandic dataset contains the replication data for Somers & Barðdal (2022) and Somers, Jenset & Barðdal (2024a). The data have also been used in Somers & Barðdal (2023), Somers, Jenset & Barðdal (2024b) and Elens, Somers & Barðdal (2024). The dataset, which has been compiled using the Icelandic Web 2020 corpus, or isTenTen20, contains 200 observations each for 15 Icelandic verbs, thus amounting to a total of 3,000 observations. The verbs in question belong to one of three syntactic classes: (1) Dat-Nom verbs, (2) Nom-Dat verbs and (3) Dat-Nom/Nom-Dat verbs. All tokens have been annotated for lemma, verb class and the internal order of the dative and the nominative arguments. In addition, each constituent, dative and nominative, has been annotated for (1) case, (2) (pro)nominality, (3) pronoun type (if applicable), (4) referentiality, (5) person, (6) number, (7) definiteness, (8) animacy and (9) length. (2025-06-06) |
Subject
| Arts and Humanities |
Keyword
| Icelandic
Corpus research
Argument structure
Dative subjects
Word order
Dat-Nom/Nom-Dat verbs
Dat-Nom verbs
Nom-Dat verbs
Pronouns vs. full NPs
Topicality |
Related Publication
| Somers, Joren & Jóhanna Barðdal. 2022. Alternating Dat-Nom/Nom-Dat verbs in Icelandic: An exploratory corpus-based analysis. Working Papers in Scandinavian Syntax 107: 83–110. url: https://projekt.ht.lu.se/fileadmin/user_upload/sol/ovrigt/projekt_grimm/working_papers/2022-dec/Somers-Barddal.pdf
Somers, Joren & Jóhanna Barðdal. 2023. Comparing the argument structure of alternating Dat-Nom/Nom-Dat predicates in German and Icelandic. Working Papers in Scandinavian Syntax 108: 1–25. url: https://projekt.ht.lu.se/fileadmin/user_upload/sol/ovrigt/projekt_grimm/working_papers/2023-jun/Somers_Barddal.WPSS.108.juli.pdf
Somers, Joren, Gard B. Jenset & Jóhanna Barðdal. 2024a. Argument structure constructions in competition: The Dat-Nom/Nom-Dat alternation in Icelandic. Nordic Journal of Linguistics (online first article). doi: 10.1017/S0332586524000039 https://doi.org/10.1017/S0332586524000039
Somers, Joren, Gard B. Jenset & Jóhanna Barðdal. 2024b. Subjecthood and argument structure of synonymous Dat-Nom/Nom-Dat verbs across German and Icelandic. Lingua 312, 103833. doi: 10.1016/j.lingua.2024.103833 https://doi.org/10.1016/j.lingua.2024.103833 |
Language
| English |
Producer
| Ghent University https://www.ugent.be/en |
Contributor
| Data Collector : Joren Somers
Data Manager : Joren Somers
Project Member : Joren Somers
Data Collector : Jóhanna Barðdal
Project Leader : Jóhanna Barðdal
Supervisor : Jóhanna Barðdal |
Funding Information
| Ghent University’s Special Research Fund's Concerted Research Action Scheme: BOF-GOA grant nr. 01G01319 |
Distributor
| The Tromsø Repository of Language and Linguistics (TROLLing) (TROLLing) https://trolling.uit.no/ |
Depositor
| Somers, Joren |
Deposit Date
| 2025-06-06 |
Time Period
| Start Date: 2020-08-01 ; End Date: 2020-09-30 |
Date of Collection
| Start Date: 2022-05-01 ; End Date: 2022-09-30 |
Data Type
| Annotated corpus data |
Data Source
| The data in this dataset were retrieved from the isTenTen20 corpus, which was accessed through the SketchEngine interface (Jakubíček et al. 2013). For more information about the isTenTen20 corpus, see: https://www.sketchengine.eu/istenten-icelandic-corpus/. Attributing the individual sources has been done to the best extent. However, to retrieve individual sources, use the source website in the csv file and look for the text found in columns LeftCotext, RightCotext, ConcatenatedText. The extracted text fragments that are contained in the data file IcelandicAlternatingVerbs.csv of this dataset only represent non-substantial portions of the sources listed above, and they do not represent coherent larger texts. Therefore, the reuse (including redistribution) of these excerpts is permitted by the exceptions rules in IPR and database protection regulations, such as Fair use (USA cf. US Copyright Act), Fair dealing (UK; cf. Exceptions to copyright), the EU Database Directive (cf. article 8 Rights and obligations of lawful users), "lover, forskrifter, rettsavgjørelser og andre vedtak av offentlig myndighet" (Norway; cf. § 14 in Åndsverkloven), "uvesentlige deler av databaser" (Norway; cf. § 24 in Åndsverkloven), "sitatretten" (Norway; cf. § 29 in Åndsverkloven). As these excerpts do not represent substantial parts of the reused sources, the redistribution of these excerpts is according to Creative Commons (CC) also permitted if they are extracted from sources that are distributed under Creative Commons licenses (cf. question "Do I always have to comply with the license terms? If not, what are the exceptions?" in the Creative Commons Frequently Asked Questions). |