{"id":87505,"identifier":"A3SATC","persistentUrl":"https://doi.org/10.18710/A3SATC","protocol":"doi","authority":"10.18710","publisher":"DataverseNO","publicationDate":"2021-01-30","storageIdentifier":"S3://10.18710/A3SATC","datasetVersion":{"id":3763,"datasetId":87505,"datasetPersistentId":"doi:10.18710/A3SATC","storageIdentifier":"S3://10.18710/A3SATC","versionNumber":1,"versionMinorNumber":2,"versionState":"RELEASED","UNF":"UNF:6:S8E8T69mmiEVrGGC0XPIrg==","lastUpdateTime":"2023-09-28T19:49:22Z","releaseTime":"2023-09-28T19:49:22Z","createTime":"2023-09-28T15:41:48Z","publicationDate":"2021-01-30","citationDate":"2021-01-30","termsOfUse":"
This dataset, \"Actually in contemporary British speech: Data from the Spoken BNC corpora\", may be reused according to the Creative Commons Attribution-NonCommercial 4.0 International (CC BY-NC 4.0) license as described here: https://creativecommons.org/licenses/by-nc/4.0/.
\r\n\r\nThe reuse of this present dataset is restricted to non-commercial because the data provider of the source used in the file \"actually_data_1994_position.csv\", i.e. the British National Corpus (BNC), does not permit commercial reuse of its data. The restrictions specified below apply only to the file \"actually_data_1994_position.csv\".
","restrictions":"In the file \"actually_data_1994_position.csv\", the contents of the columns \"left_context\" and \"right_context\" are extracts from the British National Corpus (XML edition; http://www.natcorp.ox.ac.uk/). The BNC User Licence (cf. http://www.natcorp.ox.ac.uk/docs/licence.html) states that the use of such extracts is only licensed under the fair dealings provision of UK Copyright Law (cf. https://www.gov.uk/guidance/exceptions-to-copyright#fair-dealing).
\r\n\r\nAccording to UK Copyright Law (cf. https://www.gov.uk/guidance/exceptions-to-copyright#fair-dealing), “[f]actors that have been identified by the courts as relevant in determining whether a particular dealing with a work is fair include:\r\n
This dataset contains tabular files with information about the usage of \"actually\" in contemporary British speech. We draw on two spoken corpora: (i) The demographically sampled part of the Spoken BNC1994 (Crowdy 1995) and (ii) the Spoken BNC2014 (Love et al. 2017). For both corpora, we list the usage rate observed for each speaker (total number of words produced, number of actually tokens, normalized frequency of actually expressed as per million words), along with information about the sex and age of the informant. In total, the dataset includes n = 1,408 speakers (Spoken BNC1994DS) and n = 668 speakers (Spoken BNC2014). For each corpus, we offer data tables with additional speaker meta-data. For a subset of the Spoken BNC1994DS (speakers with available information on gender and age; n = 886 speakers; n = 2,688 tokens), we also report on the position of actually in the clause (initial, medial, final), which was annotated manually.
\n\nRelated publication: Sönning, Lukas & Manfred Krug. 2022. Comparing study designs and down-sampling strategies in corpus analysis: The importance of speaker metadata in the BNCs of 1994 and 2014. In Ole Schützler & Julia Schlüter (eds.), Data and methods in corpus linguistics: Comparative approaches, 127-159. Cambridge: Cambridge University Press. https://doi.org/10.1017/9781108589314.006
"},"dsDescriptionDate":{"typeName":"dsDescriptionDate","multiple":false,"typeClass":"primitive","value":"2022-07-08"}}]},{"typeName":"subject","multiple":true,"typeClass":"controlledVocabulary","value":["Arts and Humanities"]},{"typeName":"keyword","multiple":true,"typeClass":"compound","value":[{"keywordValue":{"typeName":"keywordValue","multiple":false,"typeClass":"primitive","value":"actually"}},{"keywordValue":{"typeName":"keywordValue","multiple":false,"typeClass":"primitive","value":"grammaticalization"}},{"keywordValue":{"typeName":"keywordValue","multiple":false,"typeClass":"primitive","value":"pragmaticalization"}},{"keywordValue":{"typeName":"keywordValue","multiple":false,"typeClass":"primitive","value":"BNC"}},{"keywordValue":{"typeName":"keywordValue","multiple":false,"typeClass":"primitive","value":"corpus data"}},{"keywordValue":{"typeName":"keywordValue","multiple":false,"typeClass":"primitive","value":"spoken"}},{"keywordValue":{"typeName":"keywordValue","multiple":false,"typeClass":"primitive","value":"frequency"}},{"keywordValue":{"typeName":"keywordValue","multiple":false,"typeClass":"primitive","value":"position"}},{"keywordValue":{"typeName":"keywordValue","multiple":false,"typeClass":"primitive","value":"discourse marker"}},{"keywordValue":{"typeName":"keywordValue","multiple":false,"typeClass":"primitive","value":"English"}},{"keywordValue":{"typeName":"keywordValue","multiple":false,"typeClass":"primitive","value":"British English"}}]},{"typeName":"publication","multiple":true,"typeClass":"compound","value":[{"publicationCitation":{"typeName":"publicationCitation","multiple":false,"typeClass":"primitive","value":"Sönning, Lukas & Manfred Krug. 2022. Comparing study designs and down-sampling strategies in corpus analysis: The importance of speaker metadata in the BNCs of 1994 and 2014. In Ole Schützler & Julia Schlüter (eds.), Data and methods in corpus linguistics: Comparative approaches, 127-159. Cambridge: Cambridge University Press. doi:10.1017/9781108589314"},"publicationIDType":{"typeName":"publicationIDType","multiple":false,"typeClass":"controlledVocabulary","value":"doi"},"publicationIDNumber":{"typeName":"publicationIDNumber","multiple":false,"typeClass":"primitive","value":"10.1017/9781108589314.006"},"publicationURL":{"typeName":"publicationURL","multiple":false,"typeClass":"primitive","value":"https://doi.org/10.1017/9781108589314.006"}}]},{"typeName":"language","multiple":true,"typeClass":"controlledVocabulary","value":["English"]},{"typeName":"producer","multiple":true,"typeClass":"compound","value":[{"producerName":{"typeName":"producerName","multiple":false,"typeClass":"primitive","value":"University of Bamberg"},"producerURL":{"typeName":"producerURL","multiple":false,"typeClass":"primitive","value":"https://www.uni-bamberg.de/en/"}}]},{"typeName":"contributor","multiple":true,"typeClass":"compound","value":[{"contributorType":{"typeName":"contributorType","multiple":false,"typeClass":"controlledVocabulary","value":"Other"},"contributorName":{"typeName":"contributorName","multiple":false,"typeClass":"primitive","value":"Stich, Felicia"}}]},{"typeName":"distributor","multiple":true,"typeClass":"compound","value":[{"distributorName":{"typeName":"distributorName","multiple":false,"typeClass":"primitive","value":"The Tromsø Repository of Language and Linguistics (TROLLing)"},"distributorAbbreviation":{"typeName":"distributorAbbreviation","multiple":false,"typeClass":"primitive","value":"TROLLing"},"distributorURL":{"typeName":"distributorURL","multiple":false,"typeClass":"primitive","value":"https://trolling.uit.no/"}}]},{"typeName":"depositor","multiple":false,"typeClass":"primitive","value":"Sönning, Lukas"},{"typeName":"dateOfDeposit","multiple":false,"typeClass":"primitive","value":"2021-01-15"},{"typeName":"timePeriodCovered","multiple":true,"typeClass":"compound","value":[{"timePeriodCoveredStart":{"typeName":"timePeriodCoveredStart","multiple":false,"typeClass":"primitive","value":"1991"},"timePeriodCoveredEnd":{"typeName":"timePeriodCoveredEnd","multiple":false,"typeClass":"primitive","value":"1993"}},{"timePeriodCoveredStart":{"typeName":"timePeriodCoveredStart","multiple":false,"typeClass":"primitive","value":"2012"},"timePeriodCoveredEnd":{"typeName":"timePeriodCoveredEnd","multiple":false,"typeClass":"primitive","value":"2016"}}]},{"typeName":"dateOfCollection","multiple":true,"typeClass":"compound","value":[{"dateOfCollectionStart":{"typeName":"dateOfCollectionStart","multiple":false,"typeClass":"primitive","value":"1991"},"dateOfCollectionEnd":{"typeName":"dateOfCollectionEnd","multiple":false,"typeClass":"primitive","value":"1993"}},{"dateOfCollectionStart":{"typeName":"dateOfCollectionStart","multiple":false,"typeClass":"primitive","value":"2012"},"dateOfCollectionEnd":{"typeName":"dateOfCollectionEnd","multiple":false,"typeClass":"primitive","value":"2016"}}]},{"typeName":"kindOfData","multiple":true,"typeClass":"primitive","value":["corpus data","observational data"]},{"typeName":"software","multiple":true,"typeClass":"compound","value":[{"softwareName":{"typeName":"softwareName","multiple":false,"typeClass":"primitive","value":"CQPweb"},"softwareVersion":{"typeName":"softwareVersion","multiple":false,"typeClass":"primitive","value":"3.3.7"}},{"softwareName":{"typeName":"softwareName","multiple":false,"typeClass":"primitive","value":"rcqp (R package)"},"softwareVersion":{"typeName":"softwareVersion","multiple":false,"typeClass":"primitive","value":"0.5"}}]},{"typeName":"dataSources","multiple":true,"typeClass":"primitive","value":["[BNC1994]: The British National Corpus, version 3 (BNC XML Edition). 2007. Distributed by Bodleian Libraries, University of Oxford, on behalf of the BNC Consortium. URL: http://www.natcorp.ox.ac.uk/.","[Spoken BNC2014]: Love, Robbie, Claire Dembry, Andrew Hardie, Vaclav Brezina & Tony McEnery. 2017. The Spoken BNC2014: Designing and building a spoken corpus of everyday conversations. International Journal of Corpus Linguistics, 22(3), 319–344."]}]},"geospatial":{"displayName":"Geospatial Metadata","name":"geospatial","fields":[{"typeName":"geographicCoverage","multiple":true,"typeClass":"compound","value":[{"country":{"typeName":"country","multiple":false,"typeClass":"controlledVocabulary","value":"United Kingdom"}}]}]}},"files":[{"label":"00_ReadMe_actually.txt","restricted":false,"version":1,"datasetVersionId":3763,"dataFile":{"id":87577,"persistentId":"doi:10.18710/A3SATC/A6OQZY","pidURL":"https://doi.org/10.18710/A3SATC/A6OQZY","filename":"00_ReadMe_actually.txt","contentType":"text/plain","filesize":10079,"storageIdentifier":"S3://2002-yellow-dataverseno:1772aa337b0-c720fe7c7822","rootDataFileId":-1,"md5":"bb720cd95cb26210eab73d1eba569c62","checksum":{"type":"MD5","value":"bb720cd95cb26210eab73d1eba569c62"},"creationDate":"2021-01-22"}},{"label":"actually_data_1994.tab","restricted":false,"version":1,"datasetVersionId":3763,"dataFile":{"id":87508,"persistentId":"doi:10.18710/A3SATC/55P8B6","pidURL":"https://doi.org/10.18710/A3SATC/55P8B6","filename":"actually_data_1994.tab","contentType":"text/tab-separated-values","filesize":63772,"storageIdentifier":"S3://2002-yellow-dataverseno:1770657c667-cf1a3a6ea6a2","originalFileFormat":"text/csv","originalFormatLabel":"Comma Separated Values","originalFileSize":60828,"originalFileName":"actually_data_1994.csv","UNF":"UNF:6:JkxhHT/73pTRKCUE/NVqgw==","rootDataFileId":-1,"md5":"f1659ce430ca70f9fc9786ffe034e676","checksum":{"type":"MD5","value":"f1659ce430ca70f9fc9786ffe034e676"},"creationDate":"2021-01-15"}},{"label":"actually_data_1994_position.tab","restricted":false,"version":1,"datasetVersionId":3763,"dataFile":{"id":87522,"persistentId":"doi:10.18710/A3SATC/T3HUND","pidURL":"https://doi.org/10.18710/A3SATC/T3HUND","filename":"actually_data_1994_position.tab","contentType":"text/tab-separated-values","filesize":668268,"storageIdentifier":"S3://2002-yellow-dataverseno:1770676b9aa-739f93149fe6","originalFileFormat":"text/csv","originalFormatLabel":"Comma Separated Values","originalFileSize":668366,"originalFileName":"actually_data_1994_position.csv","UNF":"UNF:6:8XO1y870BWH4DwgOu9UCGA==","rootDataFileId":-1,"md5":"afbb6e5c5a8774814719ea6f6cf16d2d","checksum":{"type":"MD5","value":"afbb6e5c5a8774814719ea6f6cf16d2d"},"creationDate":"2021-01-15"}},{"label":"actually_data_2014.tab","restricted":false,"version":1,"datasetVersionId":3763,"dataFile":{"id":87511,"persistentId":"doi:10.18710/A3SATC/495BTG","pidURL":"https://doi.org/10.18710/A3SATC/495BTG","filename":"actually_data_2014.tab","contentType":"text/tab-separated-values","filesize":28204,"storageIdentifier":"S3://2002-yellow-dataverseno:1770657c8ba-e378a22f52cb","originalFileFormat":"text/csv","originalFormatLabel":"Comma Separated Values","originalFileSize":28542,"originalFileName":"actually_data_2014.csv","UNF":"UNF:6:NGpek+oF5WbD+YYyOZLk1A==","rootDataFileId":-1,"md5":"ac26d5cd29d9a5e07223fb49262cdb4b","checksum":{"type":"MD5","value":"ac26d5cd29d9a5e07223fb49262cdb4b"},"creationDate":"2021-01-15"}},{"label":"data_retrieval_1994.Rmd","restricted":false,"version":1,"datasetVersionId":3763,"dataFile":{"id":87521,"persistentId":"doi:10.18710/A3SATC/VUQJ2V","pidURL":"https://doi.org/10.18710/A3SATC/VUQJ2V","filename":"data_retrieval_1994.Rmd","contentType":"application/octet-stream","filesize":11142,"storageIdentifier":"S3://2002-yellow-dataverseno:17706656d28-be8d7248bb5b","rootDataFileId":-1,"md5":"3f6aa9565a00a139287a16c8e98d5877","checksum":{"type":"MD5","value":"3f6aa9565a00a139287a16c8e98d5877"},"creationDate":"2021-01-15"}},{"label":"data_retrieval_1994.html","restricted":false,"version":1,"datasetVersionId":3763,"dataFile":{"id":87516,"persistentId":"doi:10.18710/A3SATC/9GRG4O","pidURL":"https://doi.org/10.18710/A3SATC/9GRG4O","filename":"data_retrieval_1994.html","contentType":"text/html","filesize":772734,"storageIdentifier":"S3://2002-yellow-dataverseno:17706656c2c-b0e173b8c73c","rootDataFileId":-1,"md5":"4bdca17dccd0b6838e5bbbf9f8f0cb8a","checksum":{"type":"MD5","value":"4bdca17dccd0b6838e5bbbf9f8f0cb8a"},"creationDate":"2021-01-15"}},{"label":"data_retrieval_actually_2014.Rmd","restricted":false,"version":1,"datasetVersionId":3763,"dataFile":{"id":87520,"persistentId":"doi:10.18710/A3SATC/SIPXXW","pidURL":"https://doi.org/10.18710/A3SATC/SIPXXW","filename":"data_retrieval_actually_2014.Rmd","contentType":"application/octet-stream","filesize":6577,"storageIdentifier":"S3://2002-yellow-dataverseno:17706657395-a745357ee7e8","rootDataFileId":-1,"md5":"7e07701b16eb057583bb10a71f0adbfa","checksum":{"type":"MD5","value":"7e07701b16eb057583bb10a71f0adbfa"},"creationDate":"2021-01-15"}},{"label":"data_retrieval_actually_2014.html","restricted":false,"version":1,"datasetVersionId":3763,"dataFile":{"id":87517,"persistentId":"doi:10.18710/A3SATC/UFJ5S8","pidURL":"https://doi.org/10.18710/A3SATC/UFJ5S8","filename":"data_retrieval_actually_2014.html","contentType":"text/html","filesize":759904,"storageIdentifier":"S3://2002-yellow-dataverseno:177066572aa-1df568e8793f","rootDataFileId":-1,"md5":"304912c1f6c7eedadbe8e1de88f9dde4","checksum":{"type":"MD5","value":"304912c1f6c7eedadbe8e1de88f9dde4"},"creationDate":"2021-01-15"}},{"label":"data_retrieval_speaker_biodata_2014.Rmd","restricted":false,"version":1,"datasetVersionId":3763,"dataFile":{"id":87519,"persistentId":"doi:10.18710/A3SATC/ISMA0W","pidURL":"https://doi.org/10.18710/A3SATC/ISMA0W","filename":"data_retrieval_speaker_biodata_2014.Rmd","contentType":"application/octet-stream","filesize":5645,"storageIdentifier":"S3://2002-yellow-dataverseno:17706657a8c-7b7d8ba6bc99","rootDataFileId":-1,"md5":"33d451810ac7de5cfa55bec7dbcc10fb","checksum":{"type":"MD5","value":"33d451810ac7de5cfa55bec7dbcc10fb"},"creationDate":"2021-01-15"}},{"label":"data_retrieval_speaker_biodata_2014.html","restricted":false,"version":1,"datasetVersionId":3763,"dataFile":{"id":87518,"persistentId":"doi:10.18710/A3SATC/NN8RRF","pidURL":"https://doi.org/10.18710/A3SATC/NN8RRF","filename":"data_retrieval_speaker_biodata_2014.html","contentType":"text/html","filesize":749226,"storageIdentifier":"S3://2002-yellow-dataverseno:177066579ab-ae13fdd7e9e2","rootDataFileId":-1,"md5":"6ed6f58dad0f4b9f46c5b9bdb095179b","checksum":{"type":"MD5","value":"6ed6f58dad0f4b9f46c5b9bdb095179b"},"creationDate":"2021-01-15"}}],"citation":"Sönning, Lukas; Krug, Manfred, 2021, \"Actually in contemporary British speech: Data from the Spoken BNC corpora\", https://doi.org/10.18710/A3SATC, DataverseNO, V1, UNF:6:S8E8T69mmiEVrGGC0XPIrg== [fileUNF]"}}