{"id":8451,"identifier":"FY7R8N","persistentUrl":"https://doi.org/10.18710/FY7R8N","protocol":"doi","authority":"10.18710","publisher":"DataverseNO","publicationDate":"2021-09-01","storageIdentifier":"S3://10.18710/FY7R8N","datasetVersion":{"id":2634,"datasetId":8451,"datasetPersistentId":"doi:10.18710/FY7R8N","storageIdentifier":"S3://10.18710/FY7R8N","versionNumber":1,"versionMinorNumber":2,"versionState":"RELEASED","lastUpdateTime":"2022-07-18T17:26:31Z","releaseTime":"2022-07-18T17:26:31Z","createTime":"2022-07-18T17:26:25Z","publicationDate":"2021-09-01","citationDate":"2021-09-01","termsOfUse":"
This dataset, \"Replication Data for: The history of Slavonic clausal complementation: a corpus view\", may be reused according to the Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0) license as described here: https://creativecommons.org/licenses/by-nc-sa/4.0/.
\n\nThe raw data annotated and enriched in the tabular files in this dataset, \"Replication Data for: The history of Slavonic clausal complementation: a corpus view\", has been obtained from the following sources as described in the README file contained in this dataset:
\n\nThe PROIEL Treebank; available at https://proiel.github.io/; used under the Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0 International) license (https://creativecommons.org/licenses/by-nc-sa/4.0/).
\n\nThe TOROT Treebank; available at https://torottreebank.github.io/; used under the Creative Commons Attribution-NonCommercial-ShareAlike 3.0 United States (CC BY-NC-SA 3.0 US) license (https://creativecommons.org/licenses/by-nc-sa/3.0/us/).
\n\nAccording to Creative Commons (cf. Compatible Licenses), BY-SA licenses are compatible with, i.a., \"BY-SA 3.0, or a later version of the BY-SA license\".
","fileAccessRequest":true,"metadataBlocks":{"citation":{"displayName":"Citation Metadata","name":"citation","fields":[{"typeName":"title","multiple":false,"typeClass":"primitive","value":"Replication Data for: The history of Slavonic clausal complementation: a corpus view"},{"typeName":"author","multiple":true,"typeClass":"compound","value":[{"authorName":{"typeName":"authorName","multiple":false,"typeClass":"primitive","value":"Eckhoff, Hanne"},"authorAffiliation":{"typeName":"authorAffiliation","multiple":false,"typeClass":"primitive","value":"University of Oxford"},"authorIdentifierScheme":{"typeName":"authorIdentifierScheme","multiple":false,"typeClass":"controlledVocabulary","value":"ORCID"},"authorIdentifier":{"typeName":"authorIdentifier","multiple":false,"typeClass":"primitive","value":"0000-0001-8096-6515"}}]},{"typeName":"datasetContact","multiple":true,"typeClass":"compound","value":[{"datasetContactName":{"typeName":"datasetContactName","multiple":false,"typeClass":"primitive","value":"Eckhoff, Hanne"},"datasetContactAffiliation":{"typeName":"datasetContactAffiliation","multiple":false,"typeClass":"primitive","value":"University of Oxford"},"datasetContactEmail":{"typeName":"datasetContactEmail","multiple":false,"typeClass":"primitive","value":"hanneme@gmail.com"}}]},{"typeName":"dsDescription","multiple":true,"typeClass":"compound","value":[{"dsDescriptionValue":{"typeName":"dsDescriptionValue","multiple":false,"typeClass":"primitive","value":"This dataset provides replication data for an article on complementation structures in early Slavonic. Syntactic annotation of historical text, with no access to native-speaker intuitions, poses a number of problems to the annotator who is faced with the task of giving a single analysis of each sentence. The article reports on the experiences from annotating complementation structures in Old Church Slavonic and Old East Slavonic in the PROIEL and TOROT treebanks.
\n\nTwo case studies are examined: complement clauses in Old Church Slavonic and the history of Russian čьto ‘what, which, that’. In the first case the annotation scheme is shown to work well in terms of interannotator agreement and retrievability. However, the price is that a large number of examples with jako ‘that’ are analysed as complement clauses with a subjunction, even though many of these examples are in fact ambiguous and jako can equally well be interpreted as a quotative particle followed by direct speech.
\n\nThe second case study looks at a development from situation where čьto could be taken to be an interrogative pronoun in all subordinated clauses, to a situation where a subjunction and a relative pronoun analysis are also available. This leads to a large number of ambiguous occurrences. The solution in TOROT is to to analyse unambiguous interrogative pronoun and subjunction examples at face value, while all of the remaining occurrences are analysed as relative clauses. This makes the annotator's job manageable, but causes retrievability problems, since individual researchers will have to sift through the relative clause examples themselves.
"},"dsDescriptionDate":{"typeName":"dsDescriptionDate","multiple":false,"typeClass":"primitive","value":"2021-08-18"}}]},{"typeName":"subject","multiple":true,"typeClass":"controlledVocabulary","value":["Arts and Humanities"]},{"typeName":"keyword","multiple":true,"typeClass":"compound","value":[{"keywordValue":{"typeName":"keywordValue","multiple":false,"typeClass":"primitive","value":"syntax"}},{"keywordValue":{"typeName":"keywordValue","multiple":false,"typeClass":"primitive","value":"complementation"}},{"keywordValue":{"typeName":"keywordValue","multiple":false,"typeClass":"primitive","value":"Old Church Slavonic"}},{"keywordValue":{"typeName":"keywordValue","multiple":false,"typeClass":"primitive","value":"Old East Slavonic"}},{"keywordValue":{"typeName":"keywordValue","multiple":false,"typeClass":"primitive","value":"Middle Russian"}}]},{"typeName":"publication","multiple":true,"typeClass":"compound","value":[{"publicationCitation":{"typeName":"publicationCitation","multiple":false,"typeClass":"primitive","value":"Hanne Eckhoff (2021): The history of Slavonic clausal complementation: a corpus view. In Björn Wiemer and Barbara Sonnenhauser: Clausal Complementation in South Slavic. Berlin: De Gruyter Mouton. https://doi.org/10.1515/9783110725858-008"},"publicationIDType":{"typeName":"publicationIDType","multiple":false,"typeClass":"controlledVocabulary","value":"doi"},"publicationIDNumber":{"typeName":"publicationIDNumber","multiple":false,"typeClass":"primitive","value":"10.1515/9783110725858-008"},"publicationURL":{"typeName":"publicationURL","multiple":false,"typeClass":"primitive","value":"https://doi.org/10.1515/9783110725858-008"}}]},{"typeName":"language","multiple":true,"typeClass":"controlledVocabulary","value":["English"]},{"typeName":"producer","multiple":true,"typeClass":"compound","value":[{"producerName":{"typeName":"producerName","multiple":false,"typeClass":"primitive","value":"University of Oxford"},"producerURL":{"typeName":"producerURL","multiple":false,"typeClass":"primitive","value":"https://www.ox.ac.uk/"}}]},{"typeName":"distributor","multiple":true,"typeClass":"compound","value":[{"distributorName":{"typeName":"distributorName","multiple":false,"typeClass":"primitive","value":"The Tromsø Repository of Language and Linguistics (TROLLing)"},"distributorAbbreviation":{"typeName":"distributorAbbreviation","multiple":false,"typeClass":"primitive","value":"TROLLing"},"distributorURL":{"typeName":"distributorURL","multiple":false,"typeClass":"primitive","value":"https://trolling.uit.no/"}}]},{"typeName":"depositor","multiple":false,"typeClass":"primitive","value":"Eckhoff, Hanne"},{"typeName":"dateOfDeposit","multiple":false,"typeClass":"primitive","value":"2019-08-27"},{"typeName":"timePeriodCovered","multiple":true,"typeClass":"compound","value":[{"timePeriodCoveredStart":{"typeName":"timePeriodCoveredStart","multiple":false,"typeClass":"primitive","value":"0863-01-01"},"timePeriodCoveredEnd":{"typeName":"timePeriodCoveredEnd","multiple":false,"typeClass":"primitive","value":"1675-12-31"}}]},{"typeName":"dateOfCollection","multiple":true,"typeClass":"compound","value":[{"dateOfCollectionStart":{"typeName":"dateOfCollectionStart","multiple":false,"typeClass":"primitive","value":"2017-03-26"},"dateOfCollectionEnd":{"typeName":"dateOfCollectionEnd","multiple":false,"typeClass":"primitive","value":"2017-04-05"}}]},{"typeName":"kindOfData","multiple":true,"typeClass":"primitive","value":["datasets with linguistic annotation","R script"]},{"typeName":"software","multiple":true,"typeClass":"compound","value":[{"softwareName":{"typeName":"softwareName","multiple":false,"typeClass":"primitive","value":"R"},"softwareVersion":{"typeName":"softwareVersion","multiple":false,"typeClass":"primitive","value":"4.0.3"}}]},{"typeName":"dataSources","multiple":true,"typeClass":"primitive","value":["The PROIEL Treebank. Available at https://proiel.github.io/.
\n\nDag T. T. Haug and Marius L. Jøhndal. 2008. 'Creating a Parallel Treebank of the Old Indo-European Bible Translations'. In Caroline Sporleder and Kiril Ribarov (eds.). Proceedings of the Second Workshop on Language Technology for Cultural Heritage Data (LaTeCH 2008) (2008), pp. 27-34.
","The TOROT Treebank. Available at https://torottreebank.github.io/.
\n\nHanne Martine Eckhoff and Aleksandrs Berdicevskis. 2015. 'Linguistics vs. digital editions: The Tromsø Old Russian and OCS Treebank'. Scripta & e-Scripta 14–15, pp. 9-25. Open Access version availalbe at https://hdl.handle.net/10037/22366.
"]}]},"geospatial":{"displayName":"Geospatial Metadata","name":"geospatial","fields":[{"typeName":"geographicCoverage","multiple":true,"typeClass":"compound","value":[{"country":{"typeName":"country","multiple":false,"typeClass":"controlledVocabulary","value":"Macedonia, the Former Yugoslav Republic of"}},{"country":{"typeName":"country","multiple":false,"typeClass":"controlledVocabulary","value":"Bulgaria"}},{"country":{"typeName":"country","multiple":false,"typeClass":"controlledVocabulary","value":"Ukraine"}},{"country":{"typeName":"country","multiple":false,"typeClass":"controlledVocabulary","value":"Russian Federation"}}]}]}},"files":[{"description":"Additional dataset for correlative structures, to be processed by comps.r","label":"apos.csv","restricted":false,"version":1,"datasetVersionId":2634,"dataFile":{"id":104714,"persistentId":"doi:10.18710/FY7R8N/A9DHVK","pidURL":"https://doi.org/10.18710/FY7R8N/A9DHVK","filename":"apos.csv","contentType":"text/csv","filesize":159073,"description":"Additional dataset for correlative structures, to be processed by comps.r","storageIdentifier":"S3://2002-yellow-dataverseno:17b5927dc7f-d016e62eb19c","rootDataFileId":-1,"md5":"8039f3f1d9c8f61ef599b80bece5cd4b","checksum":{"type":"MD5","value":"8039f3f1d9c8f61ef599b80bece5cd4b"},"creationDate":"2021-08-18"}},{"description":"R script processing the main datasets","label":"comps.r","restricted":false,"version":1,"datasetVersionId":2634,"dataFile":{"id":104835,"persistentId":"doi:10.18710/FY7R8N/HJAGPU","pidURL":"https://doi.org/10.18710/FY7R8N/HJAGPU","filename":"comps.r","contentType":"type/x-r-syntax","filesize":4621,"description":"R script processing the main datasets","storageIdentifier":"S3://2002-yellow-dataverseno:17b96bb4b8e-bf4f1db4af76","rootDataFileId":-1,"md5":"2fc0aef629076d427024f676241ac23b","checksum":{"type":"MD5","value":"2fc0aef629076d427024f676241ac23b"},"creationDate":"2021-08-30"}},{"description":"Full data set to be read by comps.r","label":"comps260317.csv","restricted":false,"version":1,"datasetVersionId":2634,"dataFile":{"id":104739,"persistentId":"doi:10.18710/FY7R8N/P088NK","pidURL":"https://doi.org/10.18710/FY7R8N/P088NK","filename":"comps260317.csv","contentType":"text/csv","filesize":911740,"description":"Full data set to be read by comps.r","storageIdentifier":"S3://2002-yellow-dataverseno:17b67eecad5-22b89428555f","rootDataFileId":-1,"md5":"1ef143b1ff6958192ddebecad62b04e5","checksum":{"type":"MD5","value":"1ef143b1ff6958192ddebecad62b04e5"},"creationDate":"2021-08-21"}},{"description":"README with descriptions of all the other files","label":"readme.txt","restricted":false,"version":1,"datasetVersionId":2634,"dataFile":{"id":104882,"persistentId":"doi:10.18710/FY7R8N/VECP74","pidURL":"https://doi.org/10.18710/FY7R8N/VECP74","filename":"readme.txt","contentType":"text/plain","filesize":12620,"description":"README with descriptions of all the other files","storageIdentifier":"S3://2002-yellow-dataverseno:17b9b72c943-84f138cd8a7a","rootDataFileId":-1,"md5":"47e13c5b1034c980a1fd375104c91104","checksum":{"type":"MD5","value":"47e13c5b1034c980a1fd375104c91104"},"creationDate":"2021-08-31"}},{"description":"Dataset with annotation for conversion of person reference in jako-clauses","label":"thirdperson_tagged.csv","restricted":false,"version":1,"datasetVersionId":2634,"dataFile":{"id":104883,"persistentId":"doi:10.18710/FY7R8N/YAS9MY","pidURL":"https://doi.org/10.18710/FY7R8N/YAS9MY","filename":"thirdperson_tagged.csv","contentType":"text/csv","filesize":90223,"description":"Dataset with annotation for conversion of person reference in jako-clauses","storageIdentifier":"S3://2002-yellow-dataverseno:17b9b734907-3e46a8c40045","rootDataFileId":-1,"md5":"884186ae56ffb9144378c3bb34156a83","checksum":{"type":"MD5","value":"884186ae56ffb9144378c3bb34156a83"},"creationDate":"2021-08-31"}}],"citation":"Eckhoff, Hanne, 2021, \"Replication Data for: The history of Slavonic clausal complementation: a corpus view\", https://doi.org/10.18710/FY7R8N, DataverseNO, V1"}}