10.18710/PGDWXC
Cvrček, Václav0000-0003-3977-2393(Czech National Corpus)
Czech word and MWE lists
DataverseNO
2020
doi:10.18710/PGDWXC/AQYZYIdoi:10.18710/PGDWXC/TVCY3Ddoi:10.18710/PGDWXC/UIOVCFdoi:10.18710/PGDWXC/U7CUYJdoi:10.18710/PGDWXC/0IKRV6doi:10.18710/PGDWXC/V3AFIKdoi:10.18710/PGDWXC/0BHZIAdoi:10.18710/PGDWXC/OM7Z7Gdoi:10.18710/PGDWXC/CPWC3Ddoi:10.18710/PGDWXC/ZP0VNTdoi:10.18710/PGDWXC/6LSQ66doi:10.18710/PGDWXC/QFGZQYdoi:10.18710/PGDWXC/WEWH0Jdoi:10.18710/PGDWXC/VLMXD3doi:10.18710/PGDWXC/4OBJEGdoi:10.18710/PGDWXC/2L0WCXdoi:10.18710/PGDWXC/RJMBJKdoi:10.18710/PGDWXC/KRPYZSdoi:10.18710/PGDWXC/6GFUUTdoi:10.18710/PGDWXC/HBSYAAdoi:10.18710/PGDWXC/HBH8IPdoi:10.18710/PGDWXC/EMPVKBdoi:10.18710/PGDWXC/OIBHQDdoi:10.18710/PGDWXC/6IZENYdoi:10.18710/PGDWXC/SOGZA7doi:10.18710/PGDWXC/LMBA5Odoi:10.18710/PGDWXC/QRG5HOdoi:10.18710/PGDWXC/RCWQIBdoi:10.18710/PGDWXC/VQB3P5doi:10.18710/PGDWXC/NHZH8Kdoi:10.18710/PGDWXC/LQYPHP
This post contains word and MWE (multi-word expression) lists used for the operationalization of some of the linguistic features in the multi-dimensional analysis (MDA) of Czech project carried out at the Czech National Corpus. The MDA procedure requires identifying and operationalizing linguistic features relevant for register variation in the language under scrutiny. In the Czech MDA project, some of these features were operationalized by compiling lists of words and multi-word expressions, which can then be matched against a text to identify occurrences. Compiling such a list can be tedious and error prone work, which is why we provide ours as a resource for other linguists either to adopt wholesale or at least use as a starting point to build on top of.
Lukeš, David(Czech National Corpus)Czech National Corpus