Czech word and MWE listsdoi:10.18710/PGDWXCDataverseNO2020-04-091Cvrček, Václav, 2020, "Czech word and MWE lists", https://doi.org/10.18710/PGDWXC, DataverseNO, V1Czech word and MWE listsdoi:10.18710/PGDWXCCvrček, VáclavCvrček, VáclavKomrsková, ZuzanaLukeš, DavidPoukarová, PetraŘehořková, AnnaZasina, Adrian JanCzech National CorpusPragueCZ.02.1.01/0.0/0.0/16_013/0001758DataverseNOThe Tromsø Repository of Language and Linguistics (TROLLing)Lukeš, DavidLukeš, David2020-04-06Arts and Humanitiesmulti-dimensional analysislexiconCzechword listThis post contains word and MWE (multi-word expression) lists used for the operationalization of some of the linguistic features in the multi-dimensional analysis (MDA) of Czech project carried out at the Czech National Corpus. The MDA procedure requires identifying and operationalizing linguistic features relevant for register variation in the language under scrutiny. In the Czech MDA project, some of these features were operationalized by compiling lists of words and multi-word expressions, which can then be matched against a text to identify occurrences. Compiling such a list can be tedious and error prone work, which is why we provide ours as a resource for other linguists either to adopt wholesale or at least use as a starting point to build on top of.1990201420172018Czech Republiccorpus dataKoditex corpus (https://wiki.korpus.cz/doku.php/en:cnk:koditex)https://doi.org/10.18710/QAJKZWCvrček, V., Komrsková, Z., Lukeš, D. et al. Comparing web-crawled and traditional corpora. Lang Resources & Evaluation (2020).10.1007/s10579-020-09487-4Cvrček, V., Komrsková, Z., Lukeš, D. et al. Comparing web-crawled and traditional corpora. Lang Resources & Evaluation (2020).Cvrček, V., Komrsková, Z., Lukeš, D., Poukarová, P., Řehořková, A., & Zasina, A. (2018). From extra- to intratextual characteristics: Charting the space of variation in Czech through MDA, Corpus Linguistics and Linguistic Theory (published online ahead of print).10.1515/cllt-2018-0020Cvrček, V., Komrsková, Z., Lukeš, D., Poukarová, P., Řehořková, A., & Zasina, A. (2018). From extra- to intratextual characteristics: Charting the space of variation in Czech through MDA, Corpus Linguistics and Linguistic Theory (published online ahead of print).00_README.docxStart here.application/vnd.openxmlformats-officedocument.wordprocessingml.document00_README.pdfStart here.application/pdfASIM.txttext/plainAUG.txttext/plainCOH2.txttext/plainDEMI.txttext/plainDP.txttext/plainDT.txttext/plainEXP.txttext/plainFEI.txttext/plainFOUU.txttext/plainFUOU.txttext/plainFYEJ.txttext/plainGENL.txttext/plainGRAAA.txttext/plainKONT.txttext/plainLAT.txttext/plainMOD.txttext/plainPAJ.txttext/plainPAV.txttext/plainPOE.txttext/plainPRE2.txttext/plainPROPA.txttext/plainPROPT.txttext/plainPSB.txttext/plainPVB.txttext/plainRST.txttext/plainVD.txttext/plainVTS.txttext/plainVUL.txttext/plainVYPW.txttext/plain