Concessive constructions in varieties of English: Corpus datahttps://doi.org/10.18710/1JMFVRSchützler, OleDataverseNO2021-02-112023-09-29T04:32:59ZThe data were used in a corpus-based study that investigates the variation of concessive constructions across nine varieties of English. Concessive constructions are here taken to consist of a subordinate clause linked to a matrix clause using one of the three subordinating conjunctions 'although', 'though' or 'even though'. For each occurrence, the data contain information concerning its semantic properties, the position of the subordinate clause, the conjunction that was used, the finite or nonfinite status of the subordinate clause as well as its length. Further, each token is annotated for variety, mode of production (spoken vs. written) and genre (or text type). It is also possible to model the text frequencies of conjunctions and semantic subtypes, since in the respective data tables counts are given for each text in the corpora, along with the total word count per text.Arts and HumanitiesCorpus linguisticsConcessivesEnglishSubordinating conjunctionsEnglishSchützler, Ole. Monograph in preparation for publication. Concessive constructions in varieties of English.Schützler, Ole. 2020. 'Although'-constructions in varieties of English. World Englishes 39: 443–461., doi, 10.1111/weng.12484, https://doi.org/10.1111/weng.12484Schützler, Ole. 2018. Concessive constructions in varieties of English. University of Bamberg: unpublished postdoctoral thesis.Schützler, Ole. 2017. A corpus-based study of concessive conjunctions in three L1-varieties of English. In: Isabelle Buchstaller & Beat Siebenhaar (eds.), Language variation – European perspectives VI. Selected papers from the Eighth International Conference on Language Variation in Europe (ICLaVE 8), Leipzig, May 2015. Amsterdam / Philadelphia: John Benjamins. 173–184., doi, 10.1075/silv.19.11sch, https://doi.org/10.1075/silv.19.11sch2021-02-11Schützler, OleVetter, Fabian2021-01-20198820102013-04-012017-09-30Corpus dataNine components of The International Corpus of English: Great Britain, Ireland, Canada, Australia, Jamaica, Nigeria, India, Singapore, Hong KongUnited KingdomIrelandCanadaAustraliaJamaicaNigeriaIndiaSingaporeHong Kong<p>This dataset, "Concessive constructions in varieties of English: Corpus data", may be reused according to the Creative Commons Attribution-NonCommercial 4.0 International (CC BY-NC 4.0) license as described here: <a href="https://creativecommons.org/licenses/by-nc/4.0/"
title="TermsOfUse" target="_blank">https://creativecommons.org/licenses/by-nc/4.0/</a>.</p>
<p></p>
<p>This dataset contains data from the International Corpus of English (ICE). The ICE license (cf. <a href="https://www.ice-corpora.uzh.ch/dam/jcr:7ae594b2-ee97-4935-8022-7d2d91b60be4/ICElicence_UZH.pdf"
title="Terms of Use" target="_blank">https://www.ice-corpora.uzh.ch/dam/jcr:7ae594b2-ee97-4935-8022-7d2d91b60be4/ICElicence_UZH.pdf</a>) and the file "Corpus_licences.pdf") includes the following conditions:</p>
<p>
<ul>
<li>“The Corpus must be used for non-profit academic research purposes only. […] The Licensee agrees not to reproduce or redistribute the Corpus or to use all or any part of the Corpus texts in any commercial product or service.”</li>
<li>“Publications based on the Corpus may include citations from texts only in a way which would be permitted under the fair dealings provision of copyright law.”</li>
<li>“If you publish a paper using any ICE corpus, please send a reference to ice@es.uzh.ch.”</li>
</ul>
</p>
<p></p>
<p>In this dataset, "Concessive constructions in varieties of English: Corpus data", the data files “concessives_1.csv”, “concessives_2.csv”, and “concessives_3.csv” contain statistical data / calculations based on nine national components of the ICE. In addition, the files contain</p>
<p>
<ul>
<li>the keywords which the ICE was searched for, and for each token</li>
<li>the genre indication used in ICE, and</li>
<li>the unique alpha-numeric identifier used in ICE.</li>
</ul>
</p>
<p>However, the files do not contain any coherent (parts of) utterances which the keywords were found in as all context was removed from the data files.
<p>According to UK Copyright Law (cf. <a href="https://www.gov.uk/guidance/exceptions-to-copyright#fair-dealing"
title="BNC" target="_blank">https://www.gov.uk/guidance/exceptions-to-copyright#fair-dealing</a>), “[f]actors that have been identified by the courts as relevant in determining whether a particular dealing with a work is fair include:
<ul>
<li>"does using the work affect the market for the original work? If a use of a work acts as a substitute for it, causing the owner to lose revenue, then it is not likely to be fair"</li>
<li>"is the amount of the work taken reasonable and appropriate? Was it necessary to use the amount that was taken? Usually only part of a work may be used”</li>
</ul>
<p></p>
<p>The extracts used in this present dataset may be said to represent fair dealing according to both these factors:
<ul>
<li>The extracted material does not affect the market for the original work, as it is unlikely that any researcher would refrain from using the ICE because of the availability of the extracted material contained in the present dataset.</li>
<li>The amount of the extracted work is reasonable and appropriate as it was necessary to carry out the study, and as it is necessary to replicate the study. Also, the extracted material does not even contain the context for the keywords, and publishing the data files does therefore not infringe the copyright of the original IPR holders.</li>
</ul></p>