|
Persistent Identifier
|
doi:10.18710/1JMFVR |
|
Publication Date
|
2021-02-11 |
|
Title
| Concessive constructions in varieties of English: Corpus data |
|
Author
| Schützler, OleUniversity of BambergORCID0000-0001-8868-0191 |
|
Point of Contact
|
Use email button above to contact.
Schützler, Ole (University of Bamberg) |
|
Description
| The data were used in a corpus-based study that investigates the variation of concessive constructions across nine varieties of English. Concessive constructions are here taken to consist of a subordinate clause linked to a matrix clause using one of the three subordinating conjunctions 'although', 'though' or 'even though'. For each occurrence, the data contain information concerning its semantic properties, the position of the subordinate clause, the conjunction that was used, the finite or nonfinite status of the subordinate clause as well as its length. Further, each token is annotated for variety, mode of production (spoken vs. written) and genre (or text type). It is also possible to model the text frequencies of conjunctions and semantic subtypes, since in the respective data tables counts are given for each text in the corpora, along with the total word count per text. (2021-01-20) |
|
Subject
| Arts and Humanities |
|
Keyword
| Corpus linguistics
Concessives
English
Subordinating conjunctions |
|
Related Publication
| Schützler, Ole. 2023. Concessive constructions in varieties of English (Language Variation 9). Berlin: Language Science Press. doi 10.5281/zenodo.8375010 https://doi.org/10.5281/zenodo.8375010
Schützler, Ole. 2020. 'Although'-constructions in varieties of English. World Englishes 39: 443–461. doi 10.1111/weng.12484 https://doi.org/10.1111/weng.12484
Schützler, Ole. 2018. Concessive constructions in varieties of English. University of Bamberg: unpublished postdoctoral thesis.
Schützler, Ole. 2017. A corpus-based study of concessive conjunctions in three L1-varieties of English. In: Isabelle Buchstaller & Beat Siebenhaar (eds.), Language variation – European perspectives VI. Selected papers from the Eighth International Conference on Language Variation in Europe (ICLaVE 8), Leipzig, May 2015. Amsterdam / Philadelphia: John Benjamins. 173–184. doi 10.1075/silv.19.11sch https://doi.org/10.1075/silv.19.11sch |
|
Language
| English |
|
Producer
| University of Bamberg https://www.uni-bamberg.de/en/ |
|
Contributor
| Other: Vetter, Fabian |
|
Distributor
| The Tromsø Repository of Language and Linguistics (TROLLing) (TROLLing) https://trolling.uit.no/ |
|
Depositor
| Schützler, Ole |
|
Deposit Date
| 2021-01-20 |
|
Time Period
| Start Date: 1988; End Date: 2010 |
|
Date of Collection
| Start Date: 2013-04-01; End Date: 2017-09-30 |
|
Data Type
| Corpus data |
|
Software
| AntConc, Version: 3.5.7
R, Version: 3.6.2
RStudio, Version: 1.2.5033
R package 'brms', Version: 3.6.2
R package 'rstan', Version: 2.19.3
R package 'StanHeaders', Version: 2.21.0-1
R package 'lattice', Version: 0.20-38
R-package 'latticeExtra', Version: 0.6-29 |
|
Related Material
| Project repository on the Open Science Framework: https://osf.io/m4tfc/ |
|
Data Source
| This dataset contains data from the International Corpus of English (ICE). The ICE license (cf. https://www.ice-corpora.uzh.ch/dam/jcr:7ae594b2-ee97-4935-8022-7d2d91b60be4/ICElicence_UZH.pdf) and the file "Corpus_licences.pdf") includes the following conditions:
- “The Corpus must be used for non-profit academic research purposes only. […] The Licensee agrees not to reproduce or redistribute the Corpus or to use all or any part of the Corpus texts in any commercial product or service.”
- “Publications based on the Corpus may include citations from texts only in a way which would be permitted under the fair dealings provision of copyright law.”
- “If you publish a paper using any ICE corpus, please send a reference to ice@es.uzh.ch.”
In this dataset, "Concessive constructions in varieties of English: Corpus data", the data files “concessives_1.csv”, “concessives_2.csv”, and “concessives_3.csv” contain statistical data / calculations based on nine national components of the ICE. In addition, the files contain
- the keywords which the ICE was searched for, and for each token
- the genre indication used in ICE, and
- the unique alpha-numeric identifier used in ICE.
However, the files do not contain any coherent (parts of) utterances which the keywords were found in as all context was removed from the data files.
According to UK Copyright Law (cf. https://www.gov.uk/guidance/exceptions-to-copyright#fair-dealing), “[f]actors that have been identified by the courts as relevant in determining whether a particular dealing with a work is fair include:
- "does using the work affect the market for the original work? If a use of a work acts as a substitute for it, causing the owner to lose revenue, then it is not likely to be fair"
- "is the amount of the work taken reasonable and appropriate? Was it necessary to use the amount that was taken? Usually only part of a work may be used”
The extracts used in this present dataset may be said to represent fair dealing according to both these factors:
- The extracted material does not affect the market for the original work, as it is unlikely that any researcher would refrain from using the ICE because of the availability of the extracted material contained in the present dataset.
- The amount of the extracted work is reasonable and appropriate as it was necessary to carry out the study, and as it is necessary to replicate the study. Also, the extracted material does not even contain the context for the keywords, and publishing the data files does therefore not infringe the copyright of the original IPR holders.
|
|
Documentation and Access to Sources
| https://www.ice-corpora.uzh.ch/en.html [ICE-website at the University of Zurich] Kirk, John & Gerald Nelson. 2018. The International Corpus of English project: A progress report. World Englishes 34(4): 697–716. https://doi.org/10.1111/weng.12350 |