{"id":249618,"identifier":"JXMG1M","persistentUrl":"https://doi.org/10.18710/JXMG1M","protocol":"doi","authority":"10.18710","separator":"/","publisher":"DataverseNO","publicationDate":"2025-09-01","storageIdentifier":"S3://10.18710/JXMG1M","datasetType":"dataset","datasetVersion":{"id":4728,"datasetId":249618,"datasetPersistentId":"doi:10.18710/JXMG1M","storageIdentifier":"S3://10.18710/JXMG1M","versionNumber":1,"versionMinorNumber":0,"versionState":"RELEASED","latestVersionPublishingState":"RELEASED","deaccessionLink":"","lastUpdateTime":"2025-09-01T12:58:08Z","releaseTime":"2025-09-01T12:58:08Z","createTime":"2025-06-06T18:31:25Z","publicationDate":"2025-09-01","citationDate":"2025-09-01","termsOfUse":"<p>With the exception of the information in columns B (LeftCotext), C (KeyWord), D (RightCotext) and G (ConcatenatedText) in IcelandicAlternatingVerbs.csv, the dataset \"Replication Data for: Argument structure constructions in competition: The Dat-Nom/Nom-Dat alternation in Icelandic\" has been marked as dedicated to the public domain, as described here: <a href=\"https://creativecommons.org/publicdomain/zero/1.0/\">https://creativecommons.org/publicdomain/zero/1.0/</a>.</p> \n\n<p>Our <a href=\"https://dataverse.org/best-practices/dataverse-community-norms\">Community Norms</a> as well as good scientific practices expect that proper credit is given via citation. Please use the data citation shown on the dataset page.</p> \n\n<p>Columns B–D and G in the tabular file IcelandicAlternatingVerbs.csv, contain text fragments that have been extracted from the isTenTen20 corpus, available at <a href=\" https://www.sketchengine.eu/istenten-icelandic-corpus/\"> https://www.sketchengine.eu/istenten-icelandic-corpus/ </a>, under limitations and exceptions to IPR and database protection regulations. The contribution of the author of the present dataset to these files, as detailed in the ReadMe file, is licensed under the Creative Commons Attribution 4.0 International (CC BY 4.0) license, as described here: <a href=\"https://creativecommons.org/licenses/by/4.0/\">https://creativecommons.org/licenses/by/4.0/</a>.</p> ","fileAccessRequest":true,"metadataBlocks":{"citation":{"displayName":"Citation Metadata","name":"citation","fields":[{"typeName":"title","multiple":false,"typeClass":"primitive","value":"Replication Data for: Argument structure constructions in competition: The Dat-Nom/Nom-Dat alternation in Icelandic"},{"typeName":"author","multiple":true,"typeClass":"compound","value":[{"authorName":{"typeName":"authorName","multiple":false,"typeClass":"primitive","value":"Somers, Joren"},"authorAffiliation":{"typeName":"authorAffiliation","multiple":false,"typeClass":"primitive","value":"University of Texas at Austin"},"authorIdentifierScheme":{"typeName":"authorIdentifierScheme","multiple":false,"typeClass":"controlledVocabulary","value":"ORCID"},"authorIdentifier":{"typeName":"authorIdentifier","multiple":false,"typeClass":"primitive","value":"0000-0002-3815-3139"}},{"authorName":{"typeName":"authorName","multiple":false,"typeClass":"primitive","value":"Barðdal, Jóhanna"},"authorAffiliation":{"typeName":"authorAffiliation","multiple":false,"typeClass":"primitive","value":"Ghent University"},"authorIdentifierScheme":{"typeName":"authorIdentifierScheme","multiple":false,"typeClass":"controlledVocabulary","value":"ORCID"},"authorIdentifier":{"typeName":"authorIdentifier","multiple":false,"typeClass":"primitive","value":"0000-0003-0164-4249"}}]},{"typeName":"datasetContact","multiple":true,"typeClass":"compound","value":[{"datasetContactName":{"typeName":"datasetContactName","multiple":false,"typeClass":"primitive","value":"Somers, Joren"},"datasetContactEmail":{"typeName":"datasetContactEmail","multiple":false,"typeClass":"primitive","value":"joren.somers@ugent.be"}}]},{"typeName":"dsDescription","multiple":true,"typeClass":"compound","value":[{"dsDescriptionValue":{"typeName":"dsDescriptionValue","multiple":false,"typeClass":"primitive","value":"The current Icelandic dataset contains the replication data for Somers & Barðdal (2022) and Somers, Jenset & Barðdal (2024a). The data have also been used in Somers & Barðdal (2023), Somers, Jenset & Barðdal (2024b) and Elens, Somers & Barðdal (2024). The dataset, which has been compiled using the Icelandic Web 2020 corpus, or isTenTen20, contains 200 observations each for 15 Icelandic verbs, thus amounting to a total of 3,000 observations. The verbs in question belong to one of three syntactic classes: (1) Dat-Nom verbs, (2) Nom-Dat verbs and (3) Dat-Nom/Nom-Dat verbs. All tokens have been annotated for lemma, verb class and the internal order of the dative and the nominative arguments. In addition, each constituent, dative and nominative, has been annotated for (1) case, (2) (pro)nominality, (3) pronoun type (if applicable), (4) referentiality, (5) person, (6) number, (7) definiteness, (8) animacy and (9) length."},"dsDescriptionDate":{"typeName":"dsDescriptionDate","multiple":false,"typeClass":"primitive","value":"2025-06-06"}}]},{"typeName":"subject","multiple":true,"typeClass":"controlledVocabulary","value":["Arts and Humanities"]},{"typeName":"keyword","multiple":true,"typeClass":"compound","value":[{"keywordValue":{"typeName":"keywordValue","multiple":false,"typeClass":"primitive","value":"Icelandic"}},{"keywordValue":{"typeName":"keywordValue","multiple":false,"typeClass":"primitive","value":"Corpus research"}},{"keywordValue":{"typeName":"keywordValue","multiple":false,"typeClass":"primitive","value":"Argument structure"}},{"keywordValue":{"typeName":"keywordValue","multiple":false,"typeClass":"primitive","value":"Dative subjects"}},{"keywordValue":{"typeName":"keywordValue","multiple":false,"typeClass":"primitive","value":"Word order"}},{"keywordValue":{"typeName":"keywordValue","multiple":false,"typeClass":"primitive","value":"Dat-Nom/Nom-Dat verbs"}},{"keywordValue":{"typeName":"keywordValue","multiple":false,"typeClass":"primitive","value":"Dat-Nom verbs"}},{"keywordValue":{"typeName":"keywordValue","multiple":false,"typeClass":"primitive","value":"Nom-Dat verbs"}},{"keywordValue":{"typeName":"keywordValue","multiple":false,"typeClass":"primitive","value":"Pronouns vs. full NPs"}},{"keywordValue":{"typeName":"keywordValue","multiple":false,"typeClass":"primitive","value":"Topicality"}}]},{"typeName":"publication","multiple":true,"typeClass":"compound","value":[{"publicationCitation":{"typeName":"publicationCitation","multiple":false,"typeClass":"primitive","value":"Somers, Joren & Jóhanna Barðdal. 2022. Alternating Dat-Nom/Nom-Dat verbs in Icelandic: An exploratory corpus-based analysis. Working Papers in Scandinavian Syntax 107: 83–110."},"publicationIDType":{"typeName":"publicationIDType","multiple":false,"typeClass":"controlledVocabulary","value":"url"},"publicationURL":{"typeName":"publicationURL","multiple":false,"typeClass":"primitive","value":"https://projekt.ht.lu.se/fileadmin/user_upload/sol/ovrigt/projekt_grimm/working_papers/2022-dec/Somers-Barddal.pdf"}},{"publicationCitation":{"typeName":"publicationCitation","multiple":false,"typeClass":"primitive","value":"Somers, Joren & Jóhanna Barðdal. 2023. Comparing the argument structure of alternating Dat-Nom/Nom-Dat predicates in German and Icelandic. Working Papers in Scandinavian Syntax 108: 1–25."},"publicationIDType":{"typeName":"publicationIDType","multiple":false,"typeClass":"controlledVocabulary","value":"url"},"publicationURL":{"typeName":"publicationURL","multiple":false,"typeClass":"primitive","value":"https://projekt.ht.lu.se/fileadmin/user_upload/sol/ovrigt/projekt_grimm/working_papers/2023-jun/Somers_Barddal.WPSS.108.juli.pdf"}},{"publicationCitation":{"typeName":"publicationCitation","multiple":false,"typeClass":"primitive","value":"Somers, Joren, Gard B. Jenset & Jóhanna Barðdal. 2024a. Argument structure constructions in competition: The Dat-Nom/Nom-Dat alternation in Icelandic. Nordic Journal of Linguistics (online first article)."},"publicationIDType":{"typeName":"publicationIDType","multiple":false,"typeClass":"controlledVocabulary","value":"doi"},"publicationIDNumber":{"typeName":"publicationIDNumber","multiple":false,"typeClass":"primitive","value":"10.1017/S0332586524000039"},"publicationURL":{"typeName":"publicationURL","multiple":false,"typeClass":"primitive","value":"https://doi.org/10.1017/S0332586524000039"}},{"publicationCitation":{"typeName":"publicationCitation","multiple":false,"typeClass":"primitive","value":"Somers, Joren, Gard B. Jenset & Jóhanna Barðdal. 2024b. Subjecthood and argument structure of synonymous Dat-Nom/Nom-Dat verbs across German and Icelandic. Lingua 312, 103833."},"publicationIDType":{"typeName":"publicationIDType","multiple":false,"typeClass":"controlledVocabulary","value":"doi"},"publicationIDNumber":{"typeName":"publicationIDNumber","multiple":false,"typeClass":"primitive","value":"10.1016/j.lingua.2024.103833"},"publicationURL":{"typeName":"publicationURL","multiple":false,"typeClass":"primitive","value":"https://doi.org/10.1016/j.lingua.2024.103833"}}]},{"typeName":"language","multiple":true,"typeClass":"controlledVocabulary","value":["English"]},{"typeName":"producer","multiple":true,"typeClass":"compound","value":[{"producerName":{"typeName":"producerName","multiple":false,"typeClass":"primitive","value":"Ghent University"},"producerURL":{"typeName":"producerURL","multiple":false,"typeClass":"primitive","value":"https://www.ugent.be/en"}}]},{"typeName":"contributor","multiple":true,"typeClass":"compound","value":[{"contributorType":{"typeName":"contributorType","multiple":false,"typeClass":"controlledVocabulary","value":"Data Collector"},"contributorName":{"typeName":"contributorName","multiple":false,"typeClass":"primitive","value":"Joren Somers"}},{"contributorType":{"typeName":"contributorType","multiple":false,"typeClass":"controlledVocabulary","value":"Data Manager"},"contributorName":{"typeName":"contributorName","multiple":false,"typeClass":"primitive","value":"Joren Somers"}},{"contributorType":{"typeName":"contributorType","multiple":false,"typeClass":"controlledVocabulary","value":"Project Member"},"contributorName":{"typeName":"contributorName","multiple":false,"typeClass":"primitive","value":"Joren Somers"}},{"contributorType":{"typeName":"contributorType","multiple":false,"typeClass":"controlledVocabulary","value":"Data Collector"},"contributorName":{"typeName":"contributorName","multiple":false,"typeClass":"primitive","value":"Jóhanna Barðdal"}},{"contributorType":{"typeName":"contributorType","multiple":false,"typeClass":"controlledVocabulary","value":"Project Leader"},"contributorName":{"typeName":"contributorName","multiple":false,"typeClass":"primitive","value":"Jóhanna Barðdal"}},{"contributorType":{"typeName":"contributorType","multiple":false,"typeClass":"controlledVocabulary","value":"Supervisor"},"contributorName":{"typeName":"contributorName","multiple":false,"typeClass":"primitive","value":"Jóhanna Barðdal"}}]},{"typeName":"grantNumber","multiple":true,"typeClass":"compound","value":[{"grantNumberAgency":{"typeName":"grantNumberAgency","multiple":false,"typeClass":"primitive","value":"Ghent University’s Special Research Fund's Concerted Research Action Scheme"},"grantNumberValue":{"typeName":"grantNumberValue","multiple":false,"typeClass":"primitive","value":"BOF-GOA grant nr. 01G01319"}}]},{"typeName":"distributor","multiple":true,"typeClass":"compound","value":[{"distributorName":{"typeName":"distributorName","multiple":false,"typeClass":"primitive","value":"The Tromsø Repository of Language and Linguistics (TROLLing)"},"distributorAbbreviation":{"typeName":"distributorAbbreviation","multiple":false,"typeClass":"primitive","value":"TROLLing"},"distributorURL":{"typeName":"distributorURL","multiple":false,"typeClass":"primitive","value":"https://trolling.uit.no/"}}]},{"typeName":"depositor","multiple":false,"typeClass":"primitive","value":"Somers, Joren"},{"typeName":"dateOfDeposit","multiple":false,"typeClass":"primitive","value":"2025-06-06"},{"typeName":"timePeriodCovered","multiple":true,"typeClass":"compound","value":[{"timePeriodCoveredStart":{"typeName":"timePeriodCoveredStart","multiple":false,"typeClass":"primitive","value":"2020-08-01"},"timePeriodCoveredEnd":{"typeName":"timePeriodCoveredEnd","multiple":false,"typeClass":"primitive","value":"2020-09-30"}}]},{"typeName":"dateOfCollection","multiple":true,"typeClass":"compound","value":[{"dateOfCollectionStart":{"typeName":"dateOfCollectionStart","multiple":false,"typeClass":"primitive","value":"2022-05-01"},"dateOfCollectionEnd":{"typeName":"dateOfCollectionEnd","multiple":false,"typeClass":"primitive","value":"2022-09-30"}}]},{"typeName":"kindOfData","multiple":true,"typeClass":"primitive","value":["Annotated corpus data"]},{"typeName":"dataSources","multiple":true,"typeClass":"primitive","value":["The data in this dataset were retrieved from the isTenTen20 corpus, which was accessed through the SketchEngine interface (Jakubíček et al. 2013). For more information about the isTenTen20 corpus, see: <a href=\"https://www.sketchengine.eu/istenten-icelandic-corpus/\" target=\"_blank\">https://www.sketchengine.eu/istenten-icelandic-corpus/</a>. Attributing the individual sources has been done to the best extent. However, to retrieve individual sources, use the source website in the csv file and look for the text found in columns LeftCotext, RightCotext, ConcatenatedText.\nThe extracted text fragments that are contained in the data file IcelandicAlternatingVerbs.csv of this dataset only represent non-substantial portions of the sources listed above, and they do not represent coherent larger texts. Therefore, the reuse (including redistribution) of these excerpts is permitted by the exceptions rules in IPR and database protection regulations, such as Fair use (USA cf. <a href=\"https://www.copyright.gov/fair-use/more-info.html\" title=\"Fair use\" target=\"_blank\">US Copyright Act</a>), Fair dealing (UK; cf. <a href=\"https://www.gov.uk/guidance/exceptions-to-copyright\" title=\"Fair dealing\" target=\"_blank\">Exceptions to copyright</a>), the <a href=\"http://data.europa.eu/eli/dir/1996/9/2019-06-06\" title=\"Lawful users\" target=\"_blank\">EU Database Directive</a> (cf. article 8 Rights and obligations of lawful users), \"lover, forskrifter, rettsavgjørelser og andre vedtak av offentlig myndighet\" (Norway; cf. <a href=\"https://lovdata.no/lov/2018-06-15-40/§14\" title=\"offentlige vedtak\" target=\"_blank\">§ 14 in Åndsverkloven</a>), \"uvesentlige deler av databaser\" (Norway; cf. <a href=\"https://lovdata.no/lov/2018-06-15-40/§24\" title=\"uvesentlige deler av databaser\" target=\"_blank\">§ 24 in Åndsverkloven</a>), \"sitatretten\" (Norway; cf. <a href=\"https://lovdata.no/lov/2018-06-15-40/§29\" title=\"sitatretter\" target=\"_blank\">§ 29 in Åndsverkloven</a>). As these excerpts do not represent substantial parts of the reused sources, the redistribution of these excerpts is according to Creative Commons (CC) also permitted if they are extracted from sources that are distributed under Creative Commons licenses (cf. question \"Do I always have to comply with the license terms? If not, what are the exceptions?\" in the <a href=\"https://creativecommons.org/faq/\" title=\"CC FAQ\" target=\"_blank\">Creative Commons Frequently Asked Questions</a>)."]}]},"geospatial":{"displayName":"Geospatial Metadata","name":"geospatial","fields":[{"typeName":"geographicCoverage","multiple":true,"typeClass":"compound","value":[{"country":{"typeName":"country","multiple":false,"typeClass":"controlledVocabulary","value":"Iceland"}}]}]}},"files":[{"label":"0_README_IcelandicAlternatingVerbs.txt","restricted":false,"version":1,"datasetVersionId":4728,"dataFile":{"id":256132,"persistentId":"doi:10.18710/JXMG1M/K2UHFT","pidURL":"https://doi.org/10.18710/JXMG1M/K2UHFT","filename":"0_README_IcelandicAlternatingVerbs.txt","contentType":"text/plain","friendlyType":"Plain Text","filesize":17966,"storageIdentifier":"S3://uit-dataverseno-prod01:19905578bb7-40a627b09618","rootDataFileId":-1,"md5":"b8f6bec2220201af604278e1cab02c83","checksum":{"type":"MD5","value":"b8f6bec2220201af604278e1cab02c83"},"tabularData":false,"creationDate":"2025-09-01","publicationDate":"2025-09-01","fileAccessRequest":true}},{"label":"IcelandicAlternatingVerbs.csv","restricted":false,"version":1,"datasetVersionId":4728,"dataFile":{"id":254563,"persistentId":"doi:10.18710/JXMG1M/NXGYAN","pidURL":"https://doi.org/10.18710/JXMG1M/NXGYAN","filename":"IcelandicAlternatingVerbs.csv","contentType":"text/comma-separated-values","friendlyType":"Comma Separated Values","filesize":2225004,"storageIdentifier":"S3://uit-dataverseno-prod01:197c2bfbc04-34f7d717378c","rootDataFileId":-1,"md5":"14f16dcabdfb652dbfc36f1ee681dbe7","checksum":{"type":"MD5","value":"14f16dcabdfb652dbfc36f1ee681dbe7"},"tabularData":false,"creationDate":"2025-06-30","publicationDate":"2025-09-01","fileAccessRequest":true}}],"citation":"Somers, Joren; Barðdal, Jóhanna, 2025, \"Replication Data for: Argument structure constructions in competition: The Dat-Nom/Nom-Dat alternation in Icelandic\", https://doi.org/10.18710/JXMG1M, DataverseNO, V1"}}