This file is part of "Replication Data for: A multivariate account of particle alternation after bare-form try in native varieties of English".
This file has already been deleted (or replaced) in the current version. It may not be edited.
Restricting limits access to published files. People who want to use the restricted files can request access by default. If you disable request access, you must add information about access to the Terms of Access field.
The selected file or files have already been published. Contact an administrator to change the embargo date or reason of the file or files.
The file will be deleted after you click on the Delete button.
Files will not be removed from previously published versions of the dataset.
Please select one or more files.
Share this file on your favorite social media networks.
This dataset is made available under the following terms. Please confirm and/or complete the information needed below in order to continue.
Our Community Norms as well as good scientific practices expect that proper credit is given via citation. Please use the data citation shown on the dataset page.
Custom Dataset Terms — the following Custom Dataset Terms have been defined for this dataset.
This dataset, "Replication Data for: A multivariate account of particle alternation after bare-form try in native varieties of English" (henceforth: "Dataset"), may be reused according to the Creative Commons Attribution-NonCommercial 4.0 International (CC BY-NC 4.0) license as described here: https://creativecommons.org/licenses/by-nc/4.0.
This Dataset contains data from the following sources:
BNC: The British National Corpus. Examples of usage taken from the British National Corpus were obtained under the terms of the BNC End User Licence (see http://www.natcorp.ox.ac.uk/docs/licence.html or the file "BNC_End_User_Licence.pdf" included in this Dataset. Copyright in the individual texts cited resides with the original IPR holders. For information and licensing conditions relating to the BNC, please see the Terms tab on the landing page of the Dataset, and the BNC web site at http://www.natcorp.ox.ac.uk/.
Section "2 Terms of the Licence Granted to the Licensee" of the BNC End User Licence states among otherthings that "(f) [t]here is no restriction on the use of the Licensee's Results except that the Licensee may not publish in print or electronic form or exploit commercially in any form whatsoever any extracts from the BNC Processed Material other than those permitted under the fair dealings provision of copyright law."
In this Dataset, the data file "bnc.csv" contains the following information:
This means that the file does not contain any coherent (parts of) utterances which the keywords were found in as all context was removed from the data file. Therefore, publishing this data file is considered to be permitted under the fair dealings provision of copyright law; see details in section "Fair dealing" below.
COCA: Corpus of Contemporary American English. COCA does not provide an (openly accessible) end user license agreement. However, on their webpage (cf. https://www.english-corpora.org/copyright.asp; see also the file "COCA_Note_on_Copyright.pdf" included in this Dataset), they mention that the use of their source texts is "strictly for academic research, and is purely non-commercial". This may be interpreted as also the reuse of text from COCA being allowed for non-commercial purposes only. On the same webpage, COCA also provides evidence of their use and dissemination of the text sources being within the bounds of US Fair Use Law.
In this Dataset, the data file "coca.csv" contains the following information:
This means that the file does not contain any coherent (parts of) utterances which the keywords were found in as all context was removed from the data file. Therefore, publishing this data file is considered to be permitted under the fair dealings provision of copyright law; see details in section "Fair use" below.
GloWbE: Corpus of Global Web-Based English. GloWbE does not provide an (openly accessible) end user license agreement. However, on their webpage (cf. https://www.english-corpora.org/copyright.asp; see also the file "COCA_Note_on_Copyright.pdf" included in this Dataset), they mention that the use of their source texts is "strictly for academic research, and is purely non-commercial". This may be interpreted as also the reuse of text from GloWbE being allowed for non-commercial purposes only. On the same webpage, GloWbE also provides evidence of their use and dissemination of the text sources being within the bounds of US Fair Use Law.
In this Dataset, the data file "glowbe.csv" contains the following information:
ICE: The International Corpus of English, including the following components:
ICE-CAN and ICE-IRE were used under the general ICE License Agreement; see https://www.ice-corpora.uzh.ch/dam/jcr:7ae594b2-ee97-4935-8022-7d2d91b60be4/ICElicence_UZH.pdf or the file "ICE_License_Agreement.pdf" included in this Dataset.
ICE-GB was used under the ICE-GB License Agreement; see the file "ICE-GB_License_Agreement.pdf" included in this Dataset.
ICE-NZ was used under the ICE-NZ License Agreement; see the file "ICE-NZ_License_Agreement.pdf" included in this Dataset.
In this Dataset, the data file "ice.csv" contains the following information:
This means that the file only contains very limited excerpts from the works that are the bases for the ICE components that were used. Therefore, publishing this data file is considered to be permitted under the fair dealings provision of copyright law; see details in section "Fair dealing" below.
While no explicit, separate license agreement for ICE-AUS exists, its use and the publication of data from ICE-AUS as represented in this Dataset correspond to the use and publication of the data extracted from the other ICE components, and thus are considered as qualifying as fair dealing.
Fair dealing:
According to UK Copyright Law (cf. https://www.gov.uk/guidance/exceptions-to-copyright#fair-dealing), “[f]actors that have been identified by the courts as relevant in determining whether a particular dealing with a work is fair include:
The corpus extracts used in this Dataset may be said to represent fair dealing according to both of these factors:
Fair use:
According to US Copyright Act (cf. https://www.copyright.gov/fair-use/more-info.html), "Fair use is a legal doctrine that promotes freedom of expression by permitting the unlicensed use of copyright-protected works in certain circumstances". The Corpus of Contemporary American English (COCA; cf. https://www.english-corpora.org/copyright.asp; see also the file "COCA_Note_on_Copyright.pdf" included in this Dataset) provides an extended discussion of why they believe that their use of the texts in COCA is within the bounds of US Fair Use Law. These arguments may also be applied to other corpora that have been used in this Dataset. Below, the discussion by COCA is adapted to the data files included in this Dataset:
The following are the four criteria used to determine whether materials fall under the provisions of the Fair Use Law:
Criteria: The amount and substantiality of the portion taken
Criteria: The purpose and character of the use
Criteria: The nature of the copyrighted work
Criteria: The effect of the use upon the potential market
Use the Download URL in a Wget command or a download manager to download this package file. Download via web browser is not recommended. User Guide - Downloading a Dataverse Package via URL
https://dataverse.no/api/access/datafile/
Please confirm and/or complete the information needed below in order to request access to files in this dataset.
???file.mapData.unpublished.message???
Provenance is a record of the origin of your data file and any transformations it has been through. Upload a JSON file from a provenance capture tool to generate a graph of your data's provenance. For more information, please refer to our User Guide.
File must be JSON format and follow the W3C standard.
You may also add information documenting the history of your data file, including how it was created, how it has changed, and who has worked with it.
You need to Log In to request access.
This file is restricted and you may not compute on it because you have not been granted access.
DataverseNO Support
Please fill this out to prove you are not a robot.