<?xml version='1.0' encoding='UTF-8'?><codeBook xmlns="ddi:codebook:2_5" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="ddi:codebook:2_5 https://ddialliance.org/Specification/DDI-Codebook/2.5/XMLSchema/codebook.xsd" version="2.5"><docDscr><citation><titlStmt><titl>Replication Data for: Perceiving and identifying vowels in regional accents of English: Evidence from Dutch- and Spanish-speaking L2 listeners</titl><IDNo agency="DOI">doi:10.18710/FEC2BO</IDNo></titlStmt><distStmt><distrbtr source="archive">DataverseNO</distrbtr><distDate>2026-01-29</distDate></distStmt><verStmt source="archive"><version date="2026-01-29" type="RELEASED">1</version></verStmt><biblCit>Verbeke, Gil; Escudero, Paola; Mitterer, Holger; Simon, Ellen, 2026, "Replication Data for: Perceiving and identifying vowels in regional accents of English: Evidence from Dutch- and Spanish-speaking L2 listeners", https://doi.org/10.18710/FEC2BO, DataverseNO, V1</biblCit></citation></docDscr><stdyDscr><citation><titlStmt><titl>Replication Data for: Perceiving and identifying vowels in regional accents of English: Evidence from Dutch- and Spanish-speaking L2 listeners</titl><IDNo agency="DOI">doi:10.18710/FEC2BO</IDNo></titlStmt><rspStmt><AuthEnty affiliation="Ghent University">Verbeke, Gil</AuthEnty><AuthEnty affiliation="Western Sydney University">Escudero, Paola</AuthEnty><AuthEnty affiliation="University of Malta">Mitterer, Holger</AuthEnty><AuthEnty affiliation="Ghent University">Simon, Ellen</AuthEnty></rspStmt><prodStmt><producer abbr="UGent">Ghent University</producer><prodDate>2025</prodDate><prodPlac>Flanders, Belgium</prodPlac><software version="4.4.1">R</software><software version="2024.04.2+764">R Studio</software><software version="6.4.18">Praat</software><software version="2023.1.2">PsychoPy</software><software version="16.76">MS Excel</software><grantNo agency="Fonds Wetenschappelijk Onderzoek - Vlaanderen (FWO)">1178623N</grantNo><grantNo agency="Fonds Wetenschappelijk Onderzoek - Vlaanderen (FWO)">K253023N</grantNo><grantNo agency="Fonds Wetenschappelijk Onderzoek - Vlaanderen (FWO)">K131425N</grantNo></prodStmt><distStmt><distrbtr source="archive">DataverseNO</distrbtr><distrbtr abbr="TROLLing" URI="https://trolling.uit.no/">The Tromsø Repository of Language and Linguistics (TROLLing)</distrbtr><contact affiliation="Ghent University" email="Gil.Verbeke@Ugent.be">Verbeke, Gil</contact><depositr>Verbeke, Gil</depositr><depDate>2025-05-03</depDate></distStmt><holdings URI="https://doi.org/10.18710/FEC2BO"/></citation><stdyInfo><subject><keyword xml:lang="en">Arts and Humanities</keyword><keyword>English vowels</keyword><keyword>L2 perception</keyword><keyword>acoustic similarity</keyword><keyword>perceived similarity</keyword><keyword>vowel identification</keyword><keyword>English as a Foreign Language</keyword></subject><abstract date="2025-07-11">&lt;p>&lt;b>Dataset abstract&lt;/b>&lt;/p>

&lt;p>This dataset contains the results of a study on cross-language and second-language vowel perception in Dutch-speaking and Spanish-speaking learners of English. The dataset includes both acoustic similarity predictions and behavioral data from two perceptual tasks. &lt;/p>

&lt;p>For the acoustic comparisons, Linear Discriminant Analysis (LDA) models were trained on native vowel data from Dutch and Spanish speakers, recorded in earlier studies. The models were tested on English vowel tokens produced by speakers of Southern British English (S.Eng), Northern British English (N.Eng), and Australian English (AusE), and predict how similar these English vowels are to Dutch and Spanish vowels based on acoustic properties, such as formant frequencies and vowel duration.&lt;/p>

&lt;p>In addition to these acoustic predictions, the dataset includes behavioral responses collected during two experimental sessions. In the first session, 40 L1 Dutch and 40 L1 Spanish participants completed (i) a demographic and language background questionnaire, (ii) a cross-language vowel categorization task consisting of 210 trials, and (iii) a general vocabulary test (LexTALE; Lemhöfer &amp; Broersma, 2012). During the cross-language categorization task, participants listened to English vowels produced in the three accents and indicated which vowel from their native language was most similar to that vowel, followed by a goodness-of-fit rating (i.e., how good an example of that vowel the sound was). In the second session, the same participants completed a second-language vowel categorization task with the same 210 trials, in which they were asked to identify which English vowel they heard and to rate how good an example of that vowel it was.&lt;/p>

&lt;p> The participants’ cross-language categorization responses were compared to the acoustic similarity scores from the LDA models, to assess how perceived (phonetic) similarity and acoustic similarity align. Participants' identification accuracy in the second-language task was analyzed using a mixed-effects logistic regression model. The repository includes all raw and processed data, the R code used for statistical analysis, and the model outputs.&lt;/p></abstract><abstract date="2025-01-26">&lt;p>&lt;b>Article abstract&lt;/b>&lt;/p>
&lt;p>
This study examines how L2 English listeners perceive and categorize vowels produced in three regional accents of English: Southern British (S.Eng), Northern British (N.Eng), and Australian English (AusE). Specifically, we investigate how L1 speakers of Belgian Dutch and European Spanish classify these vowels in terms of their native vowel categories, and how such perceptual classifications relate to acoustic similarity between L1-L2 vowels and L2 vowel identification accuracy. To quantify cross-language acoustic similarity and predict which L2 vowel contrasts would be perceptually challenging, Linear Discriminant Analysis (LDA) models were trained on Dutch and Spanish vowel data and tested on English vowel data. 40 Dutch-speaking and 40 Spanish-speaking participants then completed a cross-language categorization task and second-language vowel identification task using naturally produced /CVC/ syllables. The results demonstrate that LDA-based acoustic similarity
largely predicts cross-language perception, although certain vowel categorization
patterns point to differences in acoustic cue-weighting between the LDA models and
participants. Compared to Spanish listeners, Dutch listeners’ classifications showed
greater divergence from the LDA model, likely reflecting the denser vowel inventory of
Dutch and the resulting increase in category competition. Additionally, participants’
cross-language vowel categorization responses predicted their L2 vowel identification
accuracy. That is, L2 vowels consistently mapped onto a (single) different L1 category
with high goodness-of-fit were more likely to be identified correctly. Identification
accuracy was highest for S.Eng vowels, aligning with participants’ greater self-reported familiarity with that accent. Together, our findings highlight the complex interplay between cross-language similarity, vowel inventory and accent familiarity in shaping L2 perception. &lt;/p></abstract><sumDscr><collDate cycle="P1" event="start" date="2025-03-01">2025-03-01</collDate><collDate cycle="P1" event="end" date="2025-03-31">2025-03-31</collDate><nation>Belgium</nation><dataKind>sociodemographic and linguistic background information</dataKind><dataKind>LexTALE scores</dataKind><dataKind>experimental data</dataKind><dataKind>cross-language vowel categorization data</dataKind><dataKind>second-language vowel categorization data</dataKind></sumDscr></stdyInfo><method><dataColl><sources><dataSrc>&lt;ul>
  &lt;li>
    Recordings of European Spanish speakers were sourced from the following study: 
    Chládková, K., Escudero, P., &amp; Boersma, P. (2011). Context-specific acoustic differences between Peruvian and Iberian Spanish vowels. 
    &lt;i>The Journal of the Acoustical Society of America, 130(1), 416–428.&lt;/i> 
    &lt;a href="https://doi.org/10.1121/1.3592242" target="_blank">https://doi.org/10.1121/1.3592242&lt;/a>
  &lt;/li>
  &lt;li>
    Recordings of Belgian Dutch speakers were sourced from the following study: 
    Adank, P., Van Hout, R., &amp; Velde, H. V. D. (2007). An acoustic description of the vowels of northern and southern standard Dutch II: Regional varieties. 
    &lt;i>The Journal of the Acoustical Society of America, 121(2), 1130–1141.&lt;/i> 
    &lt;a href="https://doi.org/10.1121/1.2409492" target="_blank">https://doi.org/10.1121/1.2409492&lt;/a>
  &lt;/li>
  &lt;li>
    Recordings of Australian English speakers were sourced from the following study: 
    Elvin, J., Williams, D., &amp; Escudero, P. (2016). Dynamic acoustic properties of monophthongs and diphthongs in Western Sydney Australian English. 
    &lt;i>The Journal of the Acoustical Society of America, 140(1), 576–581.&lt;/i> 
    &lt;a href="https://doi.org/10.1121/1.4952387" target="_blank">https://doi.org/10.1121/1.4952387&lt;/a>
  &lt;/li>
  &lt;li>
    Recordings of Australian English speakers were sourced from the following study: 
    Estival, D., Cassidy, S., Cox, F., &amp; Burnham, D. (2014). AusTalk: An audio-visual corpus of Australian English. 
    &lt;i>In Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC’14) (pp. 3105–3109). European Language Resources Association (ELRA).&lt;/i> 
    &lt;a href="http://www.lrec-conf.org/proceedings/lrec2014/pdf/520_Paper.pdf" target="_blank">http://www.lrec-conf.org/proceedings/lrec2014/pdf/520_Paper.pdf&lt;/a>
  &lt;/li>
  &lt;li>
    Recordings of Northern British English speakers were sourced from the following study: 
    Strycharczuk, P., Kirkham, S., Gorman, E., &amp; Nagamine, T. (2025). Dimensionality reduction in lingual articulation of vowels: Evidence from lax vowels in Northern Anglo-English. 
    &lt;i>Language and Speech.&lt;/i> 
    &lt;a href="https://doi.org/10.1177/00238309251320581" target="_blank">https://doi.org/10.1177/00238309251320581&lt;/a>
  &lt;/li>
&lt;/ul></dataSrc></sources></dataColl><anlyInfo/></method><dataAccs><setAvail/><useStmt/><notes type="DVN:TOU" level="dv">&lt;a href="http://creativecommons.org/publicdomain/zero/1.0">CC0 1.0&lt;/a></notes></dataAccs><othrStdyMat><relPubl><citation><titlStmt><titl>Verbeke, G., Escudero, P., Mitterer, H., &amp; Simon, E. (under review). Perceiving and identifying vowels in regional accents of English: Evidence from Dutch- and Spanish-speaking L2 listeners.</titl></titlStmt><biblCit>Verbeke, G., Escudero, P., Mitterer, H., &amp; Simon, E. (under review). Perceiving and identifying vowels in regional accents of English: Evidence from Dutch- and Spanish-speaking L2 listeners.</biblCit></citation></relPubl></othrStdyMat></stdyDscr><otherMat ID="f268084" URI="https://dataverse.no/api/access/datafile/268084" level="datafile"><labl>0_RegionalAccents_README.txt</labl><txt>README file with general and methodological information about the study, as well as an overview of the data and files.</txt><notes level="file" type="DATAVERSE:CONTENTTYPE" subject="Content/MIME Type">text/plain</notes></otherMat><otherMat ID="f256223" URI="https://dataverse.no/api/access/datafile/256223" level="datafile"><labl>RegionalAccents_DPIA.pdf</labl><txt>Short assessment of whether the open publication of this dataset may be said to be in line with applicable legal regulations and research-ethical guidelines.   
</txt><notes level="file" type="DATAVERSE:CONTENTTYPE" subject="Content/MIME Type">application/pdf</notes></otherMat><otherMat ID="f254673" URI="https://dataverse.no/api/access/datafile/254673" level="datafile"><labl>RegionalAccents_InformationLetter.pdf</labl><txt>Information letter participants received before giving informed consent. </txt><notes level="file" type="DATAVERSE:CONTENTTYPE" subject="Content/MIME Type">application/pdf</notes></otherMat><otherMat ID="f254688" URI="https://dataverse.no/api/access/datafile/254688" level="datafile"><labl>RegionalAccents_InformedConsent.pdf</labl><txt>Informed consent sheet. </txt><notes level="file" type="DATAVERSE:CONTENTTYPE" subject="Content/MIME Type">application/pdf</notes></otherMat><otherMat ID="f254682" URI="https://dataverse.no/api/access/datafile/254682" level="datafile"><labl>RegionalAccents_S.Eng_SentenceReading.pdf</labl><txt>Contains the sentences spoken by the S.Eng (Southern English) speaker during the recording session. </txt><notes level="file" type="DATAVERSE:CONTENTTYPE" subject="Content/MIME Type">application/pdf</notes></otherMat><otherMat ID="f254666" URI="https://dataverse.no/api/access/datafile/254666" level="datafile"><labl>RegionalAccents_S.Eng_Speakers.csv</labl><txt>Demographic and linguistic background of the S.Eng speakers. </txt><notes level="file" type="DATAVERSE:CONTENTTYPE" subject="Content/MIME Type">text/comma-separated-values</notes></otherMat><otherMat ID="f254678" URI="https://dataverse.no/api/access/datafile/254678" level="datafile"><labl>RegionalAccents_AusE.csv</labl><txt>Measurements of Australian English vowels, based on vowel productions in Elvin et al. (2016) and Estival et al. (2014). </txt><notes level="file" type="DATAVERSE:CONTENTTYPE" subject="Content/MIME Type">text/comma-separated-values</notes></otherMat><otherMat ID="f254671" URI="https://dataverse.no/api/access/datafile/254671" level="datafile"><labl>RegionalAccents_Dutch.csv</labl><txt>Measurements of Belgian Dutch vowels, based on vowel productions in Adank et al. (2007). </txt><notes level="file" type="DATAVERSE:CONTENTTYPE" subject="Content/MIME Type">text/comma-separated-values</notes></otherMat><otherMat ID="f254669" URI="https://dataverse.no/api/access/datafile/254669" level="datafile"><labl>RegionalAccents_N.Eng.csv</labl><txt>Measurements of Northern British English vowels, based on vowel productions in Strycharczuk et al. (2025). </txt><notes level="file" type="DATAVERSE:CONTENTTYPE" subject="Content/MIME Type">text/comma-separated-values</notes></otherMat><otherMat ID="f254685" URI="https://dataverse.no/api/access/datafile/254685" level="datafile"><labl>RegionalAccents_S.Eng.csv</labl><txt>Measurements of Southern British English vowels. </txt><notes level="file" type="DATAVERSE:CONTENTTYPE" subject="Content/MIME Type">text/comma-separated-values</notes></otherMat><otherMat ID="f254663" URI="https://dataverse.no/api/access/datafile/254663" level="datafile"><labl>RegionalAccents_Spanish.csv</labl><txt>Measurements of European Spanish vowels, based on vowel productions in Chládková et al. (2011). </txt><notes level="file" type="DATAVERSE:CONTENTTYPE" subject="Content/MIME Type">text/comma-separated-values</notes></otherMat><otherMat ID="f268049" URI="https://dataverse.no/api/access/datafile/268049" level="datafile"><labl>RegionalAccents_Stimuli_Acoustics.csv</labl><txt>Acoustic properties of the stimulus materials. </txt><notes level="file" type="DATAVERSE:CONTENTTYPE" subject="Content/MIME Type">text/comma-separated-values</notes></otherMat><otherMat ID="f268039" URI="https://dataverse.no/api/access/datafile/268039" level="datafile"><labl>Dutch_AusE.probs.csv</labl><txt>Output of the Dutch-trained LDA model, tested on Australian English vowel data.</txt><notes level="file" type="DATAVERSE:CONTENTTYPE" subject="Content/MIME Type">text/comma-separated-values</notes></otherMat><otherMat ID="f268040" URI="https://dataverse.no/api/access/datafile/268040" level="datafile"><labl>Dutch_N.Eng.probs.csv</labl><txt>Output of the Dutch-trained LDA model, tested on Northern British English vowel data.</txt><notes level="file" type="DATAVERSE:CONTENTTYPE" subject="Content/MIME Type">text/comma-separated-values</notes></otherMat><otherMat ID="f268041" URI="https://dataverse.no/api/access/datafile/268041" level="datafile"><labl>Dutch_S.Eng.probs.csv</labl><txt>Output of the Dutch-trained LDA model, tested on Southern British English vowel data.</txt><notes level="file" type="DATAVERSE:CONTENTTYPE" subject="Content/MIME Type">text/comma-separated-values</notes></otherMat><otherMat ID="f268042" URI="https://dataverse.no/api/access/datafile/268042" level="datafile"><labl>Spanish_AusE.probs.csv</labl><txt>Output of the Spanish-trained LDA model, tested on Australian English vowel data.</txt><notes level="file" type="DATAVERSE:CONTENTTYPE" subject="Content/MIME Type">text/comma-separated-values</notes></otherMat><otherMat ID="f268043" URI="https://dataverse.no/api/access/datafile/268043" level="datafile"><labl>Spanish_N.Eng.probs.csv</labl><txt>Output of the Spanish-trained LDA model, tested on Northern British English vowel data.</txt><notes level="file" type="DATAVERSE:CONTENTTYPE" subject="Content/MIME Type">text/comma-separated-values</notes></otherMat><otherMat ID="f268044" URI="https://dataverse.no/api/access/datafile/268044" level="datafile"><labl>Spanish_S.Eng.probs.csv</labl><txt>Output of the Spanish-trained LDA model, tested on Southern British English vowel data.</txt><notes level="file" type="DATAVERSE:CONTENTTYPE" subject="Content/MIME Type">text/comma-separated-values</notes></otherMat><otherMat ID="f268047" URI="https://dataverse.no/api/access/datafile/268047" level="datafile"><labl>RegionalAccents_Predictions_Dutch.csv</labl><txt>Cross-language acoustic similarity predictions based on the Dutch-trained LDA model.</txt><notes level="file" type="DATAVERSE:CONTENTTYPE" subject="Content/MIME Type">text/comma-separated-values</notes></otherMat><otherMat ID="f268048" URI="https://dataverse.no/api/access/datafile/268048" level="datafile"><labl>RegionalAccents_Predictions_Spanish.csv</labl><txt>Cross-language acoustic similarity predictions based on the Spanish-trained LDA model.</txt><notes level="file" type="DATAVERSE:CONTENTTYPE" subject="Content/MIME Type">text/comma-separated-values</notes></otherMat><otherMat ID="f254686" URI="https://dataverse.no/api/access/datafile/254686" level="datafile"><labl>RegionalAccents_Categorization_Dutch.csv</labl><txt>Dutch-speaking participants' cross-language and second-language categorization responses. </txt><notes level="file" type="DATAVERSE:CONTENTTYPE" subject="Content/MIME Type">text/comma-separated-values</notes></otherMat><otherMat ID="f254672" URI="https://dataverse.no/api/access/datafile/254672" level="datafile"><labl>RegionalAccents_Categorization_Spanish.csv</labl><txt>Spanish-speaking participants' cross- language and second-language categorization responses. </txt><notes level="file" type="DATAVERSE:CONTENTTYPE" subject="Content/MIME Type">text/comma-separated-values</notes></otherMat><otherMat ID="f254667" URI="https://dataverse.no/api/access/datafile/254667" level="datafile"><labl>RegionalAccents_Familiarity_Dutch.csv</labl><txt>Dutch-speaking participants' self-reported familiarity with regional accents of English. </txt><notes level="file" type="DATAVERSE:CONTENTTYPE" subject="Content/MIME Type">text/comma-separated-values</notes></otherMat><otherMat ID="f254665" URI="https://dataverse.no/api/access/datafile/254665" level="datafile"><labl>RegionalAccents_Familiarity_Spanish.csv</labl><txt>Spanish-speaking participants' self-reported familiarity with regional accents of English. </txt><notes level="file" type="DATAVERSE:CONTENTTYPE" subject="Content/MIME Type">text/comma-separated-values</notes></otherMat><otherMat ID="f254681" URI="https://dataverse.no/api/access/datafile/254681" level="datafile"><labl>RegionalAccents_Questionnaire_Dutch.csv</labl><txt>Dutch-speaking participants' responses to the background questionnaire. </txt><notes level="file" type="DATAVERSE:CONTENTTYPE" subject="Content/MIME Type">text/comma-separated-values</notes></otherMat><otherMat ID="f254689" URI="https://dataverse.no/api/access/datafile/254689" level="datafile"><labl>RegionalAccents_Questionnaire_Spanish.csv</labl><txt>Spanish-speaking participants' responses to the background questionnaire. </txt><notes level="file" type="DATAVERSE:CONTENTTYPE" subject="Content/MIME Type">text/comma-separated-values</notes></otherMat><otherMat ID="f268075" URI="https://dataverse.no/api/access/datafile/268075" level="datafile"><labl>RegionalAccents_Cross-Language_Second_Language_Classification.pdf</labl><txt>PDF output of the R Markdown script used to analyze participants' cross-language and second-language vowel categorization responses.</txt><notes level="file" type="DATAVERSE:CONTENTTYPE" subject="Content/MIME Type">application/pdf</notes></otherMat><otherMat ID="f268074" URI="https://dataverse.no/api/access/datafile/268074" level="datafile"><labl>RegionalAccents_Cross-Language_Second_Language_Classification.Rmd</labl><txt>R Markdown for the analysis of participants' cross-language and second-language vowel categorizations.</txt><notes level="file" type="DATAVERSE:CONTENTTYPE" subject="Content/MIME Type">text/x-r-notebook</notes></otherMat><otherMat ID="f268045" URI="https://dataverse.no/api/access/datafile/268045" level="datafile"><labl>RegionalAccents_Cross-Language_Vowel_Predictions.pdf</labl><txt>PDF output of the R Markdown script used to generate cross-language vowel categorization predictions based on acoustic similarity models.</txt><notes level="file" type="DATAVERSE:CONTENTTYPE" subject="Content/MIME Type">application/pdf</notes></otherMat><otherMat ID="f268046" URI="https://dataverse.no/api/access/datafile/268046" level="datafile"><labl>RegionalAccents_Cross-Language_Vowel_Predictions.Rmd</labl><txt>R Markdown for predicting cross-language acoustic similarity.</txt><notes level="file" type="DATAVERSE:CONTENTTYPE" subject="Content/MIME Type">text/x-r-notebook</notes></otherMat></codeBook>