This dataset, "Modal verbs of strong obligation in Scottish Standard English: Corpus data", may be reused according to the Creative Commons Attribution-NonCommercial 4.0 International (CC BY-NC 4.0) license as described here: https://creativecommons.org/licenses/by-nc/4.0.
This dataset contains data from two components of the International Corpus of English (ICE), namely the British and the Scottish components.
ICE-Scotland has partly been (and will continue to be) published under a Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Germany (CC BY-NC-SA 3.0 DE) licence.
The license for ICE-GB (as given in the file "Corpus_licences.pdf") includes the following conditions:
- “The Corpus […] must be used for non-profit educational purposes only. The licence cannot be transferred, lent, or resold.”
- “Copyright in all ICE-GB Texts is retained by the original copyright holders.”
- “The Licensee agrees not to reproduce or redistribute the ICE-GB Texts or to use all or any part of the ICE-GB Texts in any commercial product or service.”
- “Publications based on the ICE-GB Corpus may include citations from ICE-GB Texts only in a way which would be permitted under the fair dealings provision of copyright law.”
In this dataset, "Modal verbs of strong obligation in Scottish Standard English: Corpus data", the data file “modals.csv” contains statistical data / calculations based on two national components of the ICE. In addition, the files contain
- the keywords which the ICE was searched for, and for each token
- the genre indication used in ICE, and
- the unique alpha-numeric identifier used in ICE.
However, the files do not contain any coherent (parts of) utterances which the keywords were found in as all context was removed from the data files.
According to UK Copyright Law (cf. https://www.gov.uk/guidance/exceptions-to-copyright#fair-dealing), “[f]actors that have been identified by the courts as relevant in determining whether a particular dealing with a work is fair include:
- "does using the work affect the market for the original work? If a use of a work acts as a substitute for it, causing the owner to lose revenue, then it is not likely to be fair"
- "is the amount of the work taken reasonable and appropriate? Was it necessary to use the amount that was taken? Usually only part of a work may be used”
The extracts used in this present dataset may be said to represent fair dealing according to both these factors:
- The extracted material does not affect the market for the original work, as it is unlikely that any researcher would refrain from using the ICE because of the availability of the extracted material contained in the present dataset.
- The amount of the extracted work is reasonable and appropriate as it was necessary to carry out the study, and as it is necessary to replicate the study. Also, the extracted material does not even contain the context for the keywords, and publishing the data files does therefore not infringe the copyright of the original IPR holders.