|
Persistent Identifier
|
doi:10.18710/GIP0PM |
|
Publication Date
|
2021-02-19 |
|
Title
| Modal verbs of strong obligation in Scottish Standard English: Corpus data |
|
Author
| Schützler, OleUniversity of BambergORCID0000-0001-8868-0191
Herzky, JennyUniversity of Bamberg |
|
Point of Contact
|
Use email button above to contact.
Schützler, Ole (University of Bamberg) |
|
Description
| The dataset contains instances of the (semi-)modal verbs 'must', 'have to', 'need to' and '(have) got to' from nineteen written and spoken genres in the Scottish and British components of the International Corpus of English (ICE-SCO and ICE-GB). Nonfinite forms (e.g. 'will have to', 'might need to'), interrogatives, past-tense forms, as well as negations were excluded (partly through non-inclusion in the search, partly manually), as were all epistemic instances (predominantly of 'must') and non-obligation meanings of 'got to' (e.g. 'They got to be friends'). The sentences attached to each occurrence were manually annotated for grammatical person (of the subject) and source of obligation (objective vs. subjective). (2021-02-06) |
|
Subject
| Arts and Humanities |
|
Keyword
| Scottish Standard English
Modal verbs
Obligation |
|
Related Publication
| Schützler, O., & Herzky, J. (2022). Modal verbs of strong obligation in Scottish Standard English. English Language & Linguistics, 26(1), 133-159. doi:10.1017/S1360674321000071 doi 10.1017/S1360674321000071 https://doi.org/10.1017/S1360674321000071 |
|
Language
| English |
|
Producer
| University of Bamberg https://www.uni-bamberg.de/en/
German Research Foundation (DFG) https://www.dfg.de/en/index.jsp |
|
Contributor
| Other: Li, Zeyu |
|
Funding Information
| German Research Foundation: SCHU 3250/1-1 |
|
Distributor
| The Tromsø Repository of Language and Linguistics (TROLLing) (TROLLing) https://trolling.uit.no/ |
|
Depositor
| Schützler, Ole |
|
Deposit Date
| 2021-02-06 |
|
Time Period
| Start Date: 1989; End Date: 2018 |
|
Date of Collection
| Start Date: 2019-01-01; End Date: 2019-12-01 |
|
Data Type
| Corpus data |
|
Software
| AntConc, Version: 3.5.7
R, Version: 3.6.2
RStudio, Version: 1.2.5033
R package 'brms', Version: 3.6.2
R package 'rstan', Version: 2.19.3
R package 'StanHeaders', Version: 2.21.0-1
R package 'lattice', Version: 0.20-38 |
|
Related Material
| Documentation of analysis: https://osf.io/aq2r5/ |
|
Data Source
| This dataset contains data from two components of the International Corpus of English (ICE), namely the British and the Scottish components.
ICE-Scotland has partly been (and will continue to be) published under a Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Germany (CC BY-NC-SA 3.0 DE) licence.
The license for ICE-GB (as given in the file "Corpus_licences.pdf") includes the following conditions:
- “The Corpus […] must be used for non-profit educational purposes only. The licence cannot be transferred, lent, or resold.”
- “Copyright in all ICE-GB Texts is retained by the original copyright holders.”
- “The Licensee agrees not to reproduce or redistribute the ICE-GB Texts or to use all or any part of the ICE-GB Texts in any commercial product or service.”
- “Publications based on the ICE-GB Corpus may include citations from ICE-GB Texts only in a way which would be permitted under the fair dealings provision of copyright law.”
In this dataset, "Modal verbs of strong obligation in Scottish Standard English: Corpus data", the data file “modals.csv” contains statistical data / calculations based on two national components of the ICE. In addition, the files contain
- the keywords which the ICE was searched for, and for each token
- the genre indication used in ICE, and
- the unique alpha-numeric identifier used in ICE.
However, the files do not contain any coherent (parts of) utterances which the keywords were found in as all context was removed from the data files.
According to UK Copyright Law (cf. https://www.gov.uk/guidance/exceptions-to-copyright#fair-dealing), “[f]actors that have been identified by the courts as relevant in determining whether a particular dealing with a work is fair include:
- "does using the work affect the market for the original work? If a use of a work acts as a substitute for it, causing the owner to lose revenue, then it is not likely to be fair"
- "is the amount of the work taken reasonable and appropriate? Was it necessary to use the amount that was taken? Usually only part of a work may be used”
The extracts used in this present dataset may be said to represent fair dealing according to both these factors:
- The extracted material does not affect the market for the original work, as it is unlikely that any researcher would refrain from using the ICE because of the availability of the extracted material contained in the present dataset.
- The amount of the extracted work is reasonable and appropriate as it was necessary to carry out the study, and as it is necessary to replicate the study. Also, the extracted material does not even contain the context for the keywords, and publishing the data files does therefore not infringe the copyright of the original IPR holders.
|
|
Documentation and Access to Sources
| https://www.ice-corpora.uzh.ch/en.html [ICE-website at the University of Zurich]
Kirk, John & Gerald Nelson. 2018. The International Corpus of English project: A progress report. World Englishes 34(4): 697–716. https://doi.org/10.1111/weng.12350
Nelson, Gerald, Sean Wallis & Bas Aarts. 2002. Exploring Natural Language. Working with the British Component of the International Corpus of English. Amsterdam: John Benjamins.
Schützler, Ole, Ulrike Gut & Robert Fuchs. 2017. New perspectives on Scottish Standard English: Introducing the Scottish component of the International Corpus of English. In: Joan C. Beal & Sylvie Hancil (Hg.), Perspectives on northern Englishes. Berlin: Mouton de Gruyter [TiEL 96]. 273–302. https://doi.org/10.1515/9783110450903-012 |