Replication Data for: A long birth: The development of gender-specific paucal constructions in Russian (doi:10.18710/54ZJGQ)

View:

Part 1: Document Description
Part 2: Study Description
Part 3: Data Files Description
Part 4: Variable Description
Part 5: Other Study-Related Materials
Entire Codebook

(external link)

Document Description

Citation

Title:

Replication Data for: A long birth: The development of gender-specific paucal constructions in Russian

Identification Number:

doi:10.18710/54ZJGQ

Distributor:

DataverseNO

Date of Distribution:

2020-01-10

Version:

1

Bibliographic Citation:

Nesset, Tore, 2020, "Replication Data for: A long birth: The development of gender-specific paucal constructions in Russian", https://doi.org/10.18710/54ZJGQ, DataverseNO, V1, UNF:6:gPJ6II6hFdOw6z+tdDSMnw== [fileUNF]

Study Description

Citation

Title:

Replication Data for: A long birth: The development of gender-specific paucal constructions in Russian

Identification Number:

doi:10.18710/54ZJGQ

Authoring Entity:

Nesset, Tore (UiT The Arctic University of Norway)

Producer:

UiT The Arctic University of Norway

Distributor:

DataverseNO

Distributor:

The Tromsø Repository of Language and Linguistics (TROLLing)

Access Authority:

Nesset, Tore

Depositor:

Nesset, Tore

Date of Deposit:

2020-01-09

Holdings Information:

https://doi.org/10.18710/54ZJGQ

Study Scope

Keywords:

Arts and Humanities, Russian, numeral, paucal, S-curve, language change

Abstract:

The databases and scripts for statistical analysis included in this TROLLing post concern the so-called paucal construction in Russian where a numeral (dva ‘two’, tri ‘three’, chetyre ‘four’) is followed by an adjective and a noun. There are two versions of the database, one with examples in Cyrillic and one without. The version without examples can be used for statistical analysis, since some statistical software has problems with Cyrillic.

Article abstract: This article investigates the diachronic development of Russian numeral constructions consisting of a paucal numeral (dva ‘two’, tri ‘three’, chetyre ‘four’) followed by an adjective and a noun. Based on statistical analysis of more than 6,000 corpus examples, it is shown that a split took place in the second half of the twentieth century when feminine nouns developed a different agreement pattern from that of masculine and neuter nouns. This split is argued to represent the final step in a long “birth process” of gender-specific paucal constructions that started with the loss of the dual in the Middle Ages. It is suggested that we are witnessing a cascading effect, whereby the feminine pattern develops when the pattern for masculine and neuter nouns are approaching stabilization. The article furthermore includes a discussion of the hypothesis that “S-curves” represent a template for language change. While the documented changes resemble S-curves, the proposed analysis also addresses some general problems with testing the S-curve hypothesis empirically.

Time Period:

1825-2012

Date of Collection:

2015-2015

Country:

Russian Federation

Kind of Data:

corpus data

Methodology and Processing

Sources Statement

Data Sources:

Russian National Corpus: www.ruscorpora.ru

Data Access

Other Study Description Materials

Related Publications

Citation

Title:

Tore Nesset: A long birth: The development of gender-specific paucal constructions in Russian, Diachronica, volume 36

Identification Number:

10.1075/dia

Bibliographic Citation:

Tore Nesset: A long birth: The development of gender-specific paucal constructions in Russian, Diachronica, volume 36

File Description--f11162

File: DatabaseNumeralsRNC2018forCARTsmall.tab

  • Number of cases: 6475

  • No. of variables per record: 11

  • Type of File: text/tab-separated-values

Notes:

UNF:6:hRUuU0/uhKZryIthb/qh7w==

File Description--f11163

File: DatabaseNumeralsRNC2018forCARTsmallWithExamples.tab

  • Number of cases: 6474

  • No. of variables per record: 15

  • Type of File: text/tab-separated-values

Notes:

UNF:6:AiTcj6Ji6Vr9qpWYvER4fg==

Variable Description

List of Variables:

Variables

Numeral

f11162 Location:

Variable Format: character

Notes: UNF:6:C+A+sAA5ihLqqZrX0toG1Q==

Gender

f11162 Location:

Variable Format: character

Notes: UNF:6:w6ls1ywc2qKLCdGt7JySlg==

Case

f11162 Location:

Variable Format: character

Notes: UNF:6:vWnv9nAzrSg9/ZI+xbyFtA==

Period

f11162 Location:

Variable Format: character

Notes: UNF:6:87uciAHgNNVdV+GoV7Z6Kg==

Predicate

f11162 Location:

Variable Format: character

Notes: UNF:6:DrujOPTnt/wWt73PHRf0Rw==

Modifier

f11162 Location:

Variable Format: character

Notes: UNF:6:f9UCMZ5mQjRIWzQEbrqSng==

Preposition

f11162 Location:

Variable Format: character

Notes: UNF:6:ucmsXoIFS3ORXLtP7Odj2A==

NumeralCase

f11162 Location:

Variable Format: character

Notes: UNF:6:rC652QOqtU2oRAU+yY6tYQ==

ComplexNumeral

f11162 Location:

Variable Format: character

Notes: UNF:6:Stcm1agZN4PnlkKm3wpqzA==

ConjoinedSubject

f11162 Location:

Variable Format: character

Notes: UNF:6:7EEeigZqLcvSpJY4Ub0QgQ==

NounStress

f11162 Location:

Variable Format: character

Notes: UNF:6:nZPoZcscYIpachumDIeAiw==

Numeral

f11163 Location:

Variable Format: character

Notes: UNF:6:eAMvOICuY0XVxSQUCPXZjA==

Gender

f11163 Location:

Variable Format: character

Notes: UNF:6:9zyOiIgTl+phBolseFnmXQ==

Case

f11163 Location:

Variable Format: character

Notes: UNF:6:sAF4au0Tw8WeyeUROtcJxg==

Period

f11163 Location:

Variable Format: character

Notes: UNF:6:IAnXUs/aWr858SjrN/C3OA==

Predicate

f11163 Location:

Variable Format: character

Notes: UNF:6:UWO+AKAVBX0pryNgpeJvbQ==

Modifier

f11163 Location:

Variable Format: character

Notes: UNF:6:fGl0W2AbdAjBdrryO88c3A==

Preposition

f11163 Location:

Variable Format: character

Notes: UNF:6:BRxg6twDk5qpoBu7TPiVQA==

NumeralCase

f11163 Location:

Variable Format: character

Notes: UNF:6:UjyfIhQzFy2obmzVRNR5mw==

ComplexNumeral

f11163 Location:

Variable Format: character

Notes: UNF:6:+ReQx9uL5mhEnwRwn69Axg==

ConjoinedSubject

f11163 Location:

Variable Format: character

Notes: UNF:6:OnrKJnGtSG3F5Uggv+le9A==

NounStress

f11163 Location:

Variable Format: character

Notes: UNF:6:svV/nB6jzLv1lOz5DJZ4mg==

Leftcontext

f11163 Location:

Variable Format: character

Notes: UNF:6:qrBsU8/z2gqe3qsLYZLB1w==

Center

f11163 Location:

Variable Format: character

Notes: UNF:6:hM+/1ZpaQ2tNG3CVgQyg6A==

Punct

f11163 Location:

Variable Format: character

Notes: UNF:6:FktW8ztGaCS4C/0342LXCw==

Rightcontext

f11163 Location:

Variable Format: character

Notes: UNF:6:MT3uU/5esChzo+uQFPSmuw==

Other Study-Related Materials

Label:

DatabaseNumeralsRNC2018forCARTsmall.txt

Text:

This is the database without examples in Cyrillic.

Notes:

text/plain

Other Study-Related Materials

Label:

DatabaseNumeralsRNC2018forCARTsmallWithExamples.txt

Text:

This is the database with examples in Cyrillic.

Notes:

text/plain

Other Study-Related Materials

Label:

Numerals2018RCodeRNCdata.R

Text:

This file contains code for statistical analysis (software: R).

Notes:

type/x-r-syntax

Other Study-Related Materials

Label:

READMEDatabaseNumeralsRNC2018forCARTSmall.txt

Text:

This file contains documentation and explanation for the remaining files in the TROLLing post.

Notes:

text/plain