View: |
Part 1: Document Description
|
Citation |
|
---|---|
Title: |
Non-Standard Allomorphy in Russian Prefixes: Corpus, Experimental, and Statistical Exploration |
Identification Number: |
doi:10.18710/FKJAJR |
Distributor: |
DataverseNO |
Date of Distribution: |
2014-08-18 |
Version: |
1 |
Bibliographic Citation: |
Endresen, Anna, 2014, "Non-Standard Allomorphy in Russian Prefixes: Corpus, Experimental, and Statistical Exploration", https://doi.org/10.18710/FKJAJR, DataverseNO, V1 |
Citation |
|
Title: |
Non-Standard Allomorphy in Russian Prefixes: Corpus, Experimental, and Statistical Exploration |
Identification Number: |
doi:10.18710/FKJAJR |
Authoring Entity: |
Endresen, Anna (UiT The Arctic University of Norway) |
Producer: |
UiT The Arctic University of Norway |
Date of Production: |
2014 |
Distributor: |
DataverseNO |
Distributor: |
The Tromsø Repository of Language and Linguistics (TROLLing) |
Access Authority: |
Endresen, Anna |
Date of Deposit: |
2014-08-15 |
Holdings Information: |
https://doi.org/10.18710/FKJAJR |
Study Scope |
|
Keywords: |
Arts and Humanities, prefixes, Russian, verbs, allomorphy |
Topic Classification: |
Field: Morphology, Field: Semantics, Time-depth: synchronic, Topic: affixes |
Abstract: |
Abstract: This dissertation challenges the traditional idealized model of allomorphy by confronting it with comprehensive data on 15 Russian aspectual prefixes (RAZ-, RAS-, RAZO-, S-, SO-, PERE-, PRE-, VZ-, VOZ-, O-, OB-, OBO-, U-, VY-, IZ-) collected from corpus and linguistic experiments. The traditional definition narrows allomorphy down to a mere variation of form where the meaning remains constant and variants are distributed complementarily. My findings show that submorphemic semantic differences and distributional overlap are not uncommon properties of morpheme variants. I suggest that allomorphy is a broader phenomenon that goes beyond the axioms of complementary distribution and identical meaning. I examine non-trivial cases of prefix polysemy and multifactorial conditioning of prefix distribution that make it difficult to assess the traditional criteria for allomorphy. Moreover, I present studies of semantic dissimilation of allomorphs and overlap in distribution that violate the absolute criteria for allomorphic relationship. I take the perspective of Cognitive Linguistics and propose an alternative usage-based model of allomorphy that is flexible enough to capture both standard exemplars and non-standard deviations. This model offers detailed applications of several advanced statistical models that optimize the criteria of both semantic “sameness” and distributional complementarity. According to this model, allomorphy is a scalar relationship between morpheme variants – a relationship that can vary in terms of closeness and regularity. Statistical modeling turns the concept of allomorphy into a measurable and verifiable correspondence of form-meaning variation. This makes it possible to measure semantic simi larity and divergence and distinguish robust patterns of distribution from random effects. |
The set of files includes tagged databases, their versions used in statistical analyses and R codes for the statistical analyses described in the dissertation. |
|
Geographic Coverage: |
Russia, Russia |
Kind of Data: |
corpus |
Methodology and Processing |
|
Sources Statement |
|
Data Access |
|
Other Study Description Materials |
|
Related Publications |
|
Citation |
|
Title: |
Endresen, Anna. Non-Standard Allomorphy in Russian Prefixes: Corpus, Experimental, and Statistical Exploration. PhD Thesis. UiT The Arctic University of Norway. https://hdl.handle.net/10037/7098. |
Identification Number: |
10037/7098 |
Bibliographic Citation: |
Endresen, Anna. Non-Standard Allomorphy in Russian Prefixes: Corpus, Experimental, and Statistical Exploration. PhD Thesis. UiT The Arctic University of Norway. https://hdl.handle.net/10037/7098. |
Label: |
ALL PREFIXES CORPUS FACTITIVE VERBS.xlsx |
Text: |
A corpus-based collection of factitive verbs formed by different prefixes (O-, U-, ZA-, RAZ-, etc.) and from different types of bases (adjectival, nominal, pronominal, adverbial, etc.) |
Notes: |
application/octet-stream |
Label: |
DataOU.csv |
Text: |
This is a reduced version of the database for the purposes of the statistical analysis of the corpus data (155 verbs in O- and U-). |
Notes: |
text/plain; charset=US-ASCII |
Label: |
datOB.csv |
Text: |
This is the database of subjects responses for the Linear Regression Mixed Effects Model |
Notes: |
text/plain; charset=US-ASCII |
Label: |
O OB DATABASE CORPUS.xlsx |
Text: |
This database contains 1037 verbs in O-, OB-, and OBO- collected from the Russian National Corpus. |
Notes: |
application/octet-stream |
Label: |
O U CORPUS DATA.xlsx |
Text: |
This database contains 155 factitive (change-of-state) verbs in O- and U- collected from the Russian National Corpus. |
Notes: |
application/octet-stream |
Label: |
O U EXPERIMENTAL DATA.xlsx |
Text: |
Acceptability scores elicited from 121 subjects in the experiment on Russian factitive verbs in O- and U-. |
Notes: |
application/octet-stream |
Label: |
ObCorpus.csv |
Text: |
This is the database for the statistical analysis of the corpus data on O-, OB-, and OBO-. It excludes deetymologized verbs. |
Notes: |
text/plain; charset=US-ASCII |
Label: |
ObExperimentRandomForestData.csv |
Text: |
This is the database of subjects responses for the Classification Tree & Random Forests model |
Notes: |
text/plain; charset=US-ASCII |
Label: |
ObExperimentSubjects.csv |
Text: |
This spreadsheet contains anonymous sociolinguistic information about the subjects who participated in the experiment. |
Notes: |
text/plain; charset=US-ASCII |
Label: |
PERE PRE DATABASE.xlsx |
Text: |
This database contains 945 verbs prefixed in PERE- and PRE- collected from the Russian National Corpus. |
Notes: |
application/octet-stream |
Label: |
R script O OB CORPUS.R |
Text: |
This is the code for the statistical analysis of the corpus data on the prefixes O-, OB-, and OBO-. |
Notes: |
text/plain; charset=US-ASCII |
Label: |
R script O OB EXPERIMENT.R |
Text: |
This is the R code for both statistical models for the experimental data on the prefixes O-, OB-, and OBO-. |
Notes: |
text/plain; charset=US-ASCII |
Label: |
R script O U CORPUS DATA.R |
Text: |
R code for the statistical analysis of 155 factitive verbs in O- and U- |
Notes: |
text/plain; charset=US-ASCII |
Label: |
R script O U EXPERIMENT ALL MODELS.R |
Text: |
This is the R code for all statistical models applied to the experimental data on O- and U- in factitive verbs. |
Notes: |
text/plain; charset=UTF-8 |
Label: |
R script PERE PRE.R |
Text: |
This is the R code for several statistical tests discussed in Chapter 6. |
Notes: |
text/plain; charset=US-ASCII |
Label: |
R script RAZ.R |
Text: |
This is the R code for the statistical analysis. The statistical analysis models the distribution of these polysemous but standard allomorphs and evaluates the relative impact of each factor. |
Notes: |
text/plain; charset=US-ASCII |
Label: |
R script S SO.R |
Text: |
This is the R code for the statistical analysis discussed in Chapter 4. |
Notes: |
text/plain; charset=US-ASCII |
Label: |
R script VY IZ.R |
Text: |
This R code documents the statistical tests discussed in Chapter 8. |
Notes: |
text/plain; charset=US-ASCII |
Label: |
R script VZ VOZ.R |
Text: |
This R code documents the statistical test discussed in Chapter 7. |
Notes: |
text/plain; charset=US-ASCII |
Label: |
RAZ DATABASE.xlsx |
Text: |
This database contains 210 perfective Russian verbs formed by the prefixes RAZ-, RAS-, and RAZO- 'apart'. These prefixes represent a case of Standard Allomorphy conditioned by phonological and morphophonological factors. Two phenomena are at work here: voicing assimilation across a prefix-root boundary (prefixes RAZ- ~ RAS- 'apart') and vocalization of consonant-final Russian prefixes (RAZ- ~ RAZO- 'apart'). |
Notes: |
application/octet-stream |
Label: |
RAZ_RAS.csv |
Text: |
This is a csv version of the database designed for the purposes of the statistical analysis documented in the R code. |
Notes: |
text/plain; charset=US-ASCII |
Label: |
RAZ_RAS_RAZO.csv |
Text: |
This is a csv version of the database designed for the purposes of the statistical analysis documented in the R code. |
Notes: |
text/plain; charset=US-ASCII |
Label: |
S SO MODERN RUSSIAN.xlsx |
Text: |
This database contains 998 Modern Russian verbs in S- and SO- collected from the Russian National Corpus. Each verb is assigned a number of tags and is illustrated with examples from the corpus. |
Notes: |
application/octet-stream |
Label: |
ScoresForStatistics.csv |
Text: |
This is the database for the statistical analysis. |
Notes: |
text/plain; charset=US-ASCII |
Label: |
VY IZ DATABASE.xlsx |
Text: |
This database contains 989 verbs in VY- and IZ- attested in the Russian National Corpus. |
Notes: |
application/octet-stream |
Label: |
VZ VOZ DATABASE.xls |
Text: |
This database contains 384 verbs in VZ- and VOZ- collected from the Russian National Corpus. |
Notes: |
application/vnd.ms-excel |