3,161 to 3,170 of 4,726 Results
Oct 30, 2018
Cvrček, Václav, 2018, "Multi-Dimensional Analysis of Czech", https://doi.org/10.18710/QAJKZW, DataverseNO, V1, UNF:6:5rqhrfGF8iJspOAQER3OCA== [fileUNF]
Original data for a general-purpose multi-dimensional analysis model of register variation in Czech. This post contains a CSV data set of 137 linguistic features measured on 3428 Czech text chunks, and an R script which performs a factor analysis on this data set. The results of... |
Oct 30, 2018 -
Multi-Dimensional Analysis of Czech
MS Word - 48.1 KB -
MD5: 4b01482c643e34dc80b32e200a8c74e3
Start here. |
Oct 30, 2018 -
Multi-Dimensional Analysis of Czech
Adobe PDF - 612.3 KB -
MD5: 0ac6cbd2495d939cc50087a6334c526e
Start here. |
Oct 30, 2018 -
Multi-Dimensional Analysis of Czech
Tabular Data - 7.1 MB - 143 Variables, 3428 Observations - UNF:6:5rqhrfGF8iJspOAQER3OCA==
Values of linguistic features in individual text chunks. Each row of the table corresponds to a text chunk in the Koditex corpus. Columns represent linguistic features, and additionally text chunk ID, classification metadata (MODE, DIVISION, SUPERCLASS, CLASS) and length (_LEN).... |
Oct 30, 2018 -
Multi-Dimensional Analysis of Czech
R Syntax - 1007 B -
MD5: 4f670b765505156dc3b9cfd53b7d397d
R code for performing factor analysis on the supplied data set. |
Oct 8, 2018
Eckhoff, Hanne, 2018, "Replication Data for: A corpus approach to the history of Russian po delimitatives", https://doi.org/10.18710/PUXWXL, DataverseNO, V1, UNF:6:cBhFlWa1cdor4P8imgjjRQ== [fileUNF]
This paper gives an example of how enriched diachronic treebank data can shed new light on an old and conflicted topic, even when that topic is morphological and semantic in nature rather than syntactic. The topic is the rise of the Russian po delimitatives, a change seen as cruc... |
Plain Text - 30.2 KB -
MD5: 538383e1b651ca77fdc40fc9e72404bd
Overview and detailed description of all files. |
R Syntax - 8.7 KB -
MD5: 46081fe72080ecce0aa7b64ce5a73565
R script analysing the Old Church Slavonic data |
R Syntax - 15.1 KB -
MD5: 5fb407544d41314ca902b3374e1498f1
R script analysing the Old East Slavic and Middle Russian data |
Comma Separated Values - 14.8 MB -
MD5: 4d312861d0da0f500e80d11d1666a2ff
OCS main dataset. See 00_readme.txt for further details |