The data contained in this dataset originate from the following sources:
Habla Culta de Madrid: Esgueva, Manuel, y Margarita Cantarero (eds.). 1981. El habla de la ciudad de Madrid. Materiales para su estudio. Madrid: Consejo Superior de Investigaciones Científicas. [consulted multiple times in the period between 2016-01-01 and 2022-08-31]
CREA Oral: REAL ACADEMIA ESPAÑOLA: Banco de datos (CREA) [online]. Corpus de referencia del español actual. [consulted multiple times in the period between 2016-01-01 and 2022-08-31]
CORLEC: Corpus Oral de Referencia de la Lengua Española Contemporánea. [consulted multiple times in the period between 2016-01-01 and 2022-08-31]
Val.Es.Co: Valencia.Español.Coloquial. [consulted multiple times in the period between 2016-01-01 and 2022-08-31]
COSER: Inés Fernández-Ordóñez (dir.) (2005-): Corpus Oral y Sonoro del Español Rural. [consulted multiple times in the period between 2016-01-01 and 2022-08-31]. COSER was used under the COSER Terms of Use (, which i.a. require that COSER resources only are used for teaching or research purposes.
PRESEEA: Proyecto para el Estudio Sociolingüístico del Español de España y de América. [consulted multiple times in the period between 2016-01-01 and 2022-08-31]
C-Oral-Rom: Integrated reference corpora for spoken romance languages. Multimedia edition; tools of analysis; standard linguistic measures for validation in HLT. [consulted multiple times in the period between 2016-01-01 and 2022-08-31]
COLAm: Corpus Oral de Lenguaje Adolescente - Madrid. [consulted multiple times in the period between 2016-01-01 and 2022-08-31] COLAm was reused under a CLARIN ACA-NC license (
CORPES XXI: REAL ACADEMIA ESPAÑOLA: Banco de datos (CORPES XXI) [online]. Corpus del Español del Siglo XXI. [consulted multiple times in the period between 2016-01-01 and 2022-08-31]
CORMA: Corpus Oral de Madrid, Enghels R., De Latte F., L. Roels & E. Azofra, non published corpus. [consulted multiple times in the period between 2020-01-01 and 2022-08-31]
The dataset only contains values for indicator variables based on searches in these sources, but no actual text extracts. Therefore, the reuse (including redistribution) of these data is permitted by the exceptions rules in IPR and database protection regulations, such as Fair use (USA cf. US Copyright Act), Fair dealing (UK; cf. Exceptions to copyright), "sitatretten" (Norway; cf. § 29 i Åndsverkloven). |