{"id":275381,"identifier":"YTSGDM","persistentUrl":"https://doi.org/10.18710/YTSGDM","protocol":"doi","authority":"10.18710","separator":"/","publisher":"DataverseNO","publicationDate":"2026-05-15","storageIdentifier":"S3://10.18710/YTSGDM","datasetType":"dataset","datasetVersion":{"id":5688,"datasetId":275381,"datasetPersistentId":"doi:10.18710/YTSGDM","storageIdentifier":"S3://10.18710/YTSGDM","versionNumber":1,"versionMinorNumber":0,"versionState":"RELEASED","latestVersionPublishingState":"RELEASED","lastUpdateTime":"2026-05-15T06:36:56Z","releaseTime":"2026-05-15T06:36:56Z","createTime":"2026-04-21T11:57:12Z","publicationDate":"2026-05-15","citationDate":"2026-05-15","license":{"name":"CC0 1.0","uri":"http://creativecommons.org/publicdomain/zero/1.0","iconUri":"https://licensebuttons.net/p/zero/1.0/88x31.png","rightsIdentifier":"CC0-1.0","rightsIdentifierScheme":"SPDX","schemeUri":"https://spdx.org/licenses/","languageCode":"en"},"fileAccessRequest":true,"metadataBlocks":{"citation":{"displayName":"Citation Metadata","name":"citation","fields":[{"typeName":"title","multiple":false,"typeClass":"primitive","value":"Supporting data for: LLM-Assisted Keymorph Analysis of Grammatical Case in RT's Israeli–Palestinian Conflict Coverage"},{"typeName":"author","multiple":true,"typeClass":"compound","value":[{"authorName":{"typeName":"authorName","multiple":false,"typeClass":"primitive","value":"Lu, Tingting"},"authorAffiliation":{"typeName":"authorAffiliation","multiple":false,"typeClass":"primitive","value":"https://ror.org/00jdr0662"},"authorIdentifierScheme":{"typeName":"authorIdentifierScheme","multiple":false,"typeClass":"controlledVocabulary","value":"ORCID"},"authorIdentifier":{"typeName":"authorIdentifier","multiple":false,"typeClass":"primitive","value":"https://orcid.org/0009-0008-0744-9894","expandedvalue":{"personName":"Lu, Tingting","@id":"https://orcid.org/0009-0008-0744-9894","scheme":"ORCID","@type":"https://schema.org/Person"}}}]},{"typeName":"datasetContact","multiple":true,"typeClass":"compound","value":[{"datasetContactName":{"typeName":"datasetContactName","multiple":false,"typeClass":"primitive","value":"Lu, Tingting"},"datasetContactAffiliation":{"typeName":"datasetContactAffiliation","multiple":false,"typeClass":"primitive","value":"Beijing Foreign Studies University"},"datasetContactEmail":{"typeName":"datasetContactEmail","multiple":false,"typeClass":"primitive","value":"lutingting09@bfsu.edu.cn"}},{"datasetContactName":{"typeName":"datasetContactName","multiple":false,"typeClass":"primitive","value":"TROLLing curator"},"datasetContactAffiliation":{"typeName":"datasetContactAffiliation","multiple":false,"typeClass":"primitive","value":"TROLLing"},"datasetContactEmail":{"typeName":"datasetContactEmail","multiple":false,"typeClass":"primitive","value":"contact.trolling@uit.no"}}]},{"typeName":"dsDescription","multiple":true,"typeClass":"compound","value":[{"dsDescriptionValue":{"typeName":"dsDescriptionValue","multiple":false,"typeClass":"primitive","value":"<b>Dataset description:</b>\n<p>The dataset for this study supports a Keymorph Analysis of grammatical cases in Russian-language news headlines concerning the the 2023-2025 Israeli-Palestinian conflict, collected from RT's official news website.</p>\n<p>The dataset comprises four main components:</p>\n<ol>\n<li>Raw Headlines and Filtered Corpus: This component includes the initial collection of Russian-language headlines from RT (2023-10-07 to 2025-01-19) and the subsequently filtered corpus of 8,757 distinct headlines containing specified keywords related to the conflict (e.g., 'Israel', 'Palestine', 'Gaza', 'Hamas').</li>\n<li>Reference Corpus: The reference corpus was constructed from the National Media Subcorpus of the Russian National Corpus (RNC).</li>\n<li>Annotated Corpus of Grammatical Cases: This core component features the grammatical case annotations for 11 identified target keywords across the corpus. The annotations were generated using an LLM (ChatGPT-5 mini API) with a 20% human-reviewed and corrected sample integrated into the final dataset to ensure high quality and accuracy.</li>\n<li>Derived Analytical Data and Visualizations: This includes statistical summaries of keyword frequencies and grammatical case distributions, standardized Pearson residual values and log-likelihood (LL) ratio values crucial for keymorph identification, and various visualizations such as word frequency charts and residual heatmaps, all derived from the annotated corpus to support the keymorph analysis.</li>\n</ol>"},"dsDescriptionDate":{"typeName":"dsDescriptionDate","multiple":false,"typeClass":"primitive","value":"2026-04-21"}},{"dsDescriptionValue":{"typeName":"dsDescriptionValue","multiple":false,"typeClass":"primitive","value":"<b>Related article abstract:</b>\n<p>This study applies and extends Keymorph Analysis (KMA) with cognitive linguistic theory to investigate the representation of the Israeli–Palestinian conflict in Russia Today (RT)’s Russian-language headlines. Unlike traditional keyword analysis, which primarily focuses on lexical content, KMA reveals underlying narrative orientations by examining how systematic morphosyntactic choices contribute to the construal of participant roles. Our approach integrates three analytical layers: (1) a Quantitative Layer that identifies statistically significant keymorphs using a novel dual-reference framework (Standardized Residuals for internal distinctiveness and Log-likelihood tests against a broad reference corpus) via LLM-enhanced annotation (98.58% accuracy); (2) a Contextual Analysis Layer that maps these grammatical patterns to their specific lexical and semantic environments through corpus-assisted analysis; and (3) a Cognitive-Semantic Interpretation Layer grounded in the cognitive-semantic networks of the Russian case system. Through this integrated analysis, we identify a core-periphery hierarchy in case usage, revealing three contrastive cognitive schemas: military agents vs. humanitarian space, active entities vs. constrained subjects, and external dominance vs. regional passivity. Ultimately, this study provides a scalable, LLM-enhanced methodology for analyzing morphologically rich languages, advancing our understanding of how grammatical case assignment functions as a systematic mechanism for organizing participant positioning and constructing divergent narrative framings.</p>"},"dsDescriptionDate":{"typeName":"dsDescriptionDate","multiple":false,"typeClass":"primitive","value":"2026-04-27"}}]},{"typeName":"subject","multiple":true,"typeClass":"controlledVocabulary","value":["Arts and Humanities"]},{"typeName":"keyword","multiple":true,"typeClass":"compound","value":[{"keywordValue":{"typeName":"keywordValue","multiple":false,"typeClass":"primitive","value":"Keymorph Analysis"}},{"keywordValue":{"typeName":"keywordValue","multiple":false,"typeClass":"primitive","value":"grammatical case"}},{"keywordValue":{"typeName":"keywordValue","multiple":false,"typeClass":"primitive","value":"cognitive linguistics"}},{"keywordValue":{"typeName":"keywordValue","multiple":false,"typeClass":"primitive","value":"conflict discourse"}},{"keywordValue":{"typeName":"keywordValue","multiple":false,"typeClass":"primitive","value":"Large Language Models (LLMs)"}}]},{"typeName":"publication","multiple":true,"typeClass":"compound","value":[{"publicationRelationType":{"typeName":"publicationRelationType","multiple":false,"typeClass":"controlledVocabulary","value":"IsSupplementTo"},"publicationCitation":{"typeName":"publicationCitation","multiple":false,"typeClass":"primitive","value":"Lu, T. LLM-Assisted Keymorph Analysis of Grammatical Case in RT’s Israeli-Palestinian Conflict Coverage. <i>Russian Linguistics</i> (accepted)"}}]},{"typeName":"language","multiple":true,"typeClass":"controlledVocabulary","value":["English"]},{"typeName":"producer","multiple":true,"typeClass":"compound","value":[{"producerName":{"typeName":"producerName","multiple":false,"typeClass":"primitive","value":"Beijing Foreign Studies University"},"producerAbbreviation":{"typeName":"producerAbbreviation","multiple":false,"typeClass":"primitive","value":"BFSU"},"producerURL":{"typeName":"producerURL","multiple":false,"typeClass":"primitive","value":"https://en.bfsu.edu.cn"}}]},{"typeName":"distributor","multiple":true,"typeClass":"compound","value":[{"distributorName":{"typeName":"distributorName","multiple":false,"typeClass":"primitive","value":"The Tromsø Repository of Language and Linguistics (TROLLing)"},"distributorAbbreviation":{"typeName":"distributorAbbreviation","multiple":false,"typeClass":"primitive","value":"TROLLing"},"distributorURL":{"typeName":"distributorURL","multiple":false,"typeClass":"primitive","value":"https://trolling.uit.no/"}}]},{"typeName":"depositor","multiple":false,"typeClass":"primitive","value":"Lu, Tingting"},{"typeName":"dateOfDeposit","multiple":false,"typeClass":"primitive","value":"2026-04-21"},{"typeName":"timePeriodCovered","multiple":true,"typeClass":"compound","value":[{"timePeriodCoveredStart":{"typeName":"timePeriodCoveredStart","multiple":false,"typeClass":"primitive","value":"2023-10-07"},"timePeriodCoveredEnd":{"typeName":"timePeriodCoveredEnd","multiple":false,"typeClass":"primitive","value":"2025-01-19"}}]},{"typeName":"dateOfCollection","multiple":true,"typeClass":"compound","value":[{"dateOfCollectionStart":{"typeName":"dateOfCollectionStart","multiple":false,"typeClass":"primitive","value":"2023-10-07"},"dateOfCollectionEnd":{"typeName":"dateOfCollectionEnd","multiple":false,"typeClass":"primitive","value":"2025-01-19"}}]},{"typeName":"kindOfData","multiple":true,"typeClass":"primitive","value":["corpus data"]},{"typeName":"dataSources","multiple":true,"typeClass":"primitive","value":["RT (<a href=\"https://russian.rt.com\" title=\"URL\" target=\"_blank\">russian.rt.com</a>). Insubstantial parts of this source are reused in this dataset under exceptions and limitations to intellectual property protection set out in the <a href=\"https://lovdata.no/dokument/NL/lov/2018-06-15-40\" title=\"URL\" target=\"_blank\">Norwegian Copyright Act</a> and <a href=\"https://eur-lex.europa.eu/eli/dir/1996/9\" title=\"URL\" target=\"_blank\">EU Database Directive</a>.","Media Subcorpus of the Russian National Corpus (<a href=\"https://ruscorpora.ru/\" title=\"URL\" target=\"_blank\">ruscorpora.ru</a>). Insubstantial parts of this source are reused in this dataset under exceptions and limitations to intellectual property protection set out in the <a href=\"https://lovdata.no/dokument/NL/lov/2018-06-15-40\" title=\"URL\" target=\"_blank\">Norwegian Copyright Act</a> and <a href=\"https://eur-lex.europa.eu/eli/dir/1996/9\" title=\"URL\" target=\"_blank\">EU Database Directive</a>."]}]},"geospatial":{"displayName":"Geospatial Metadata","name":"geospatial","fields":[{"typeName":"geographicCoverage","multiple":true,"typeClass":"compound","value":[{"country":{"typeName":"country","multiple":false,"typeClass":"controlledVocabulary","value":"Russian Federation"}}]}]}},"files":[{"label":"00_ReadMe.txt","restricted":false,"version":1,"datasetVersionId":5688,"dataFile":{"id":276829,"persistentId":"doi:10.18710/YTSGDM/NXBVBT","pidURL":"https://doi.org/10.18710/YTSGDM/NXBVBT","filename":"00_ReadMe.txt","contentType":"text/plain","friendlyType":"Plain Text","filesize":32090,"storageIdentifier":"S3://uit-dataverseno-prod01:19dce7ce327-5898a08c2ec4","rootDataFileId":-1,"md5":"b33b3d8386719be5dc9f82234bbc25d3","checksum":{"type":"MD5","value":"b33b3d8386719be5dc9f82234bbc25d3"},"tabularData":false,"creationDate":"2026-04-27","publicationDate":"2026-05-15","fileAccessRequest":true}},{"label":"01_RT_data_headlines_raw.txt","restricted":false,"version":1,"datasetVersionId":5688,"dataFile":{"id":276736,"persistentId":"doi:10.18710/YTSGDM/O4F8RY","pidURL":"https://doi.org/10.18710/YTSGDM/O4F8RY","filename":"01_RT_data_headlines_raw.txt","contentType":"text/plain","friendlyType":"Plain Text","filesize":1157309,"storageIdentifier":"S3://uit-dataverseno-prod01:19dba613d4c-2fde9c84d7ca","rootDataFileId":-1,"md5":"a8444344298753732df998fb159b9ddf","checksum":{"type":"MD5","value":"a8444344298753732df998fb159b9ddf"},"tabularData":false,"creationDate":"2026-04-23","publicationDate":"2026-05-15","fileAccessRequest":true}},{"label":"02_RT_top200.py","restricted":false,"version":1,"datasetVersionId":5688,"dataFile":{"id":276739,"persistentId":"doi:10.18710/YTSGDM/VLRWTL","pidURL":"https://doi.org/10.18710/YTSGDM/VLRWTL","filename":"02_RT_top200.py","contentType":"text/x-python","friendlyType":"Python Source Code","filesize":5992,"storageIdentifier":"S3://uit-dataverseno-prod01:19dba613f69-58b31ef75b74","rootDataFileId":-1,"md5":"dcb8d6abecb20a7c3eacfa304e160d50","checksum":{"type":"MD5","value":"dcb8d6abecb20a7c3eacfa304e160d50"},"tabularData":false,"creationDate":"2026-04-23","publicationDate":"2026-05-15","fileAccessRequest":true}},{"label":"03_RT_top200_words.txt","restricted":false,"version":1,"datasetVersionId":5688,"dataFile":{"id":276778,"persistentId":"doi:10.18710/YTSGDM/JWNF25","pidURL":"https://doi.org/10.18710/YTSGDM/JWNF25","filename":"03_RT_top200_words.txt","contentType":"text/plain","friendlyType":"Plain Text","filesize":25079,"storageIdentifier":"S3://uit-dataverseno-prod01:19dc37c2c8d-b060676b00b1","rootDataFileId":-1,"md5":"7286828499ca9e2e091c37804fe5eb33","checksum":{"type":"MD5","value":"7286828499ca9e2e091c37804fe5eb33"},"tabularData":false,"creationDate":"2026-04-25","publicationDate":"2026-05-15","fileAccessRequest":true}},{"label":"04_RT_top20_word_frequency_chart.py","restricted":false,"version":1,"datasetVersionId":5688,"dataFile":{"id":276740,"persistentId":"doi:10.18710/YTSGDM/AQVEAS","pidURL":"https://doi.org/10.18710/YTSGDM/AQVEAS","filename":"04_RT_top20_word_frequency_chart.py","contentType":"text/x-python","friendlyType":"Python Source Code","filesize":881,"storageIdentifier":"S3://uit-dataverseno-prod01:19dba615b55-8b443a017dfe","rootDataFileId":-1,"md5":"419fccbba9541978c6be9b05bda38be8","checksum":{"type":"MD5","value":"419fccbba9541978c6be9b05bda38be8"},"tabularData":false,"creationDate":"2026-04-23","publicationDate":"2026-05-15","fileAccessRequest":true}},{"label":"05_RT_top20_word_frequency_chart.png","restricted":false,"version":1,"datasetVersionId":5688,"dataFile":{"id":276713,"persistentId":"doi:10.18710/YTSGDM/HXNJE4","pidURL":"https://doi.org/10.18710/YTSGDM/HXNJE4","filename":"05_RT_top20_word_frequency_chart.png","contentType":"image/png","friendlyType":"PNG Image","filesize":137221,"storageIdentifier":"S3://uit-dataverseno-prod01:19dba616142-8d0d5c230199","rootDataFileId":-1,"md5":"0534634606c70e5957efcb8d992cd3ba","checksum":{"type":"MD5","value":"0534634606c70e5957efcb8d992cd3ba"},"tabularData":false,"creationDate":"2026-04-23","publicationDate":"2026-05-15","fileAccessRequest":true}},{"label":"06_RT_data_with_keywords_raw.txt","restricted":false,"version":1,"datasetVersionId":5688,"dataFile":{"id":276723,"persistentId":"doi:10.18710/YTSGDM/V1BDG3","pidURL":"https://doi.org/10.18710/YTSGDM/V1BDG3","filename":"06_RT_data_with_keywords_raw.txt","contentType":"text/plain","friendlyType":"Plain Text","filesize":1047232,"storageIdentifier":"S3://uit-dataverseno-prod01:19dba616334-2ddcd22647d0","rootDataFileId":-1,"md5":"4b821d0c06c4c191606e46a4537e9f5a","checksum":{"type":"MD5","value":"4b821d0c06c4c191606e46a4537e9f5a"},"tabularData":false,"creationDate":"2026-04-23","publicationDate":"2026-05-15","fileAccessRequest":true}},{"label":"07_RT_annotation_AI.py","restricted":false,"version":1,"datasetVersionId":5688,"dataFile":{"id":276716,"persistentId":"doi:10.18710/YTSGDM/CYHHUD","pidURL":"https://doi.org/10.18710/YTSGDM/CYHHUD","filename":"07_RT_annotation_AI.py","contentType":"text/x-python","friendlyType":"Python Source Code","filesize":9309,"storageIdentifier":"S3://uit-dataverseno-prod01:19dba61a289-a3cb2b71a503","rootDataFileId":-1,"md5":"b40942142a3cbdffa9b074e3018b7dc7","checksum":{"type":"MD5","value":"b40942142a3cbdffa9b074e3018b7dc7"},"tabularData":false,"creationDate":"2026-04-23","publicationDate":"2026-05-15","fileAccessRequest":true}},{"label":"08_RT_data_annotated_AI.txt","restricted":false,"version":1,"datasetVersionId":5688,"dataFile":{"id":276731,"persistentId":"doi:10.18710/YTSGDM/8IV6O4","pidURL":"https://doi.org/10.18710/YTSGDM/8IV6O4","filename":"08_RT_data_annotated_AI.txt","contentType":"text/plain","friendlyType":"Plain Text","filesize":1389964,"storageIdentifier":"S3://uit-dataverseno-prod01:19dba61a926-5d5969186d24","rootDataFileId":-1,"md5":"02b2db6f2c9c9655d5857721bf6bdb33","checksum":{"type":"MD5","value":"02b2db6f2c9c9655d5857721bf6bdb33"},"tabularData":false,"creationDate":"2026-04-23","publicationDate":"2026-05-15","fileAccessRequest":true}},{"label":"09_RT_annotation_statistics_AI.csv","restricted":false,"version":1,"datasetVersionId":5688,"dataFile":{"id":276836,"persistentId":"doi:10.18710/YTSGDM/YIIGBA","pidURL":"https://doi.org/10.18710/YTSGDM/YIIGBA","filename":"09_RT_annotation_statistics_AI.csv","contentType":"text/comma-separated-values","friendlyType":"Comma Separated Values","filesize":1809,"storageIdentifier":"S3://uit-dataverseno-prod01:19dcee84cdb-ada244ede08e","rootDataFileId":-1,"md5":"3688c63dedf62a90a9e68073496cd2bb","checksum":{"type":"MD5","value":"3688c63dedf62a90a9e68073496cd2bb"},"tabularData":false,"creationDate":"2026-04-27","publicationDate":"2026-05-15","fileAccessRequest":true}},{"label":"10_RT_recall_calculation.txt","restricted":false,"version":1,"datasetVersionId":5688,"dataFile":{"id":276755,"persistentId":"doi:10.18710/YTSGDM/SKA16O","pidURL":"https://doi.org/10.18710/YTSGDM/SKA16O","filename":"10_RT_recall_calculation.txt","contentType":"text/plain","friendlyType":"Plain Text","filesize":639,"storageIdentifier":"S3://uit-dataverseno-prod01:19dbee46e23-381c7fd8a6a7","rootDataFileId":-1,"md5":"daa2d523d6063f43b98d468f39dd5ad6","checksum":{"type":"MD5","value":"daa2d523d6063f43b98d468f39dd5ad6"},"tabularData":false,"creationDate":"2026-04-24","publicationDate":"2026-05-15","fileAccessRequest":true}},{"label":"11_RT_sample_AI.txt","restricted":false,"version":1,"datasetVersionId":5688,"dataFile":{"id":276717,"persistentId":"doi:10.18710/YTSGDM/DAJ0ZW","pidURL":"https://doi.org/10.18710/YTSGDM/DAJ0ZW","filename":"11_RT_sample_AI.txt","contentType":"text/plain","friendlyType":"Plain Text","filesize":283398,"storageIdentifier":"S3://uit-dataverseno-prod01:19dba629226-a6a78ff466d5","rootDataFileId":-1,"md5":"5cf239c6a88cf3b65849f33aa7f10076","checksum":{"type":"MD5","value":"5cf239c6a88cf3b65849f33aa7f10076"},"tabularData":false,"creationDate":"2026-04-23","publicationDate":"2026-05-15","fileAccessRequest":true}},{"label":"12_RT_sample_human.txt","restricted":false,"version":1,"datasetVersionId":5688,"dataFile":{"id":276709,"persistentId":"doi:10.18710/YTSGDM/RAMGTF","pidURL":"https://doi.org/10.18710/YTSGDM/RAMGTF","filename":"12_RT_sample_human.txt","contentType":"text/plain","friendlyType":"Plain Text","filesize":283282,"storageIdentifier":"S3://uit-dataverseno-prod01:19dba62ad09-20225b6e7dba","rootDataFileId":-1,"md5":"6d08eb1669eb5403846558843012c9d9","checksum":{"type":"MD5","value":"6d08eb1669eb5403846558843012c9d9"},"tabularData":false,"creationDate":"2026-04-23","publicationDate":"2026-05-15","fileAccessRequest":true}},{"label":"13_RT_discrepancies_between_AI_and_human_annotations.csv","restricted":false,"version":1,"datasetVersionId":5688,"dataFile":{"id":276801,"persistentId":"doi:10.18710/YTSGDM/MMTICX","pidURL":"https://doi.org/10.18710/YTSGDM/MMTICX","filename":"13_RT_discrepancies_between_AI_and_human_annotations.csv","contentType":"text/comma-separated-values","friendlyType":"Comma Separated Values","filesize":13367,"storageIdentifier":"S3://uit-dataverseno-prod01:19dcdad852c-57ca7ebcfd35","rootDataFileId":-1,"md5":"735fdf6a203ccf8d0e47a1e45fd79612","checksum":{"type":"MD5","value":"735fdf6a203ccf8d0e47a1e45fd79612"},"tabularData":false,"creationDate":"2026-04-27","publicationDate":"2026-05-15","fileAccessRequest":true}},{"label":"14_RT_data_annotated_reviewed.txt","restricted":false,"version":1,"datasetVersionId":5688,"dataFile":{"id":276707,"persistentId":"doi:10.18710/YTSGDM/NHB5RZ","pidURL":"https://doi.org/10.18710/YTSGDM/NHB5RZ","filename":"14_RT_data_annotated_reviewed.txt","contentType":"text/plain","friendlyType":"Plain Text","filesize":1415234,"storageIdentifier":"S3://uit-dataverseno-prod01:19dba63375e-36c4355264c7","rootDataFileId":-1,"md5":"d904c3bef4f7adb92e56153c874ae39c","checksum":{"type":"MD5","value":"d904c3bef4f7adb92e56153c874ae39c"},"tabularData":false,"creationDate":"2026-04-23","publicationDate":"2026-05-15","fileAccessRequest":true}},{"label":"15_RT_annotation_statistics_human.csv","restricted":false,"version":1,"datasetVersionId":5688,"dataFile":{"id":276800,"persistentId":"doi:10.18710/YTSGDM/948IBZ","pidURL":"https://doi.org/10.18710/YTSGDM/948IBZ","filename":"15_RT_annotation_statistics_human.csv","contentType":"text/comma-separated-values","friendlyType":"Comma Separated Values","filesize":726,"storageIdentifier":"S3://uit-dataverseno-prod01:19dcdabfe2b-f401cd8f1482","rootDataFileId":-1,"md5":"58c1e0a9dee06e91d39cd383629ec204","checksum":{"type":"MD5","value":"58c1e0a9dee06e91d39cd383629ec204"},"tabularData":false,"creationDate":"2026-04-27","publicationDate":"2026-05-15","fileAccessRequest":true}},{"label":"16_keyword_Izrail_ruscorpora_content_4000.csv","restricted":false,"version":1,"datasetVersionId":5688,"dataFile":{"id":276805,"persistentId":"doi:10.18710/YTSGDM/Y7LFBG","pidURL":"https://doi.org/10.18710/YTSGDM/Y7LFBG","filename":"16_keyword_Izrail_ruscorpora_content_4000.csv","contentType":"text/comma-separated-values","friendlyType":"Comma Separated Values","filesize":2532741,"storageIdentifier":"S3://uit-dataverseno-prod01:19dcdaf013f-22c3e65598b7","rootDataFileId":-1,"md5":"fe71c3f417eb8e0561521a5b36c1b043","checksum":{"type":"MD5","value":"fe71c3f417eb8e0561521a5b36c1b043"},"tabularData":false,"creationDate":"2026-04-27","publicationDate":"2026-05-15","fileAccessRequest":true}},{"label":"17_keyword_CAXAL_ruscorpora_content_2618.csv","restricted":false,"version":1,"datasetVersionId":5688,"dataFile":{"id":276807,"persistentId":"doi:10.18710/YTSGDM/6HZ3HN","pidURL":"https://doi.org/10.18710/YTSGDM/6HZ3HN","filename":"17_keyword_CAXAL_ruscorpora_content_2618.csv","contentType":"text/comma-separated-values","friendlyType":"Comma Separated Values","filesize":1650893,"storageIdentifier":"S3://uit-dataverseno-prod01:19dcdb1b922-6bd91a9a30b5","rootDataFileId":-1,"md5":"ad6cb9764e274e6fae52e0eace3bfbf1","checksum":{"type":"MD5","value":"ad6cb9764e274e6fae52e0eace3bfbf1"},"tabularData":false,"creationDate":"2026-04-27","publicationDate":"2026-05-15","fileAccessRequest":true}},{"label":"18_keyword_Gaza_ruscorpora_content_4000.csv","restricted":false,"version":1,"datasetVersionId":5688,"dataFile":{"id":276808,"persistentId":"doi:10.18710/YTSGDM/3IFJHQ","pidURL":"https://doi.org/10.18710/YTSGDM/3IFJHQ","filename":"18_keyword_Gaza_ruscorpora_content_4000.csv","contentType":"text/comma-separated-values","friendlyType":"Comma Separated Values","filesize":2444077,"storageIdentifier":"S3://uit-dataverseno-prod01:19dcdb31f70-e848b63da2e8","rootDataFileId":-1,"md5":"7ae57ee63b5b2d840f1f0cdfc558b345","checksum":{"type":"MD5","value":"7ae57ee63b5b2d840f1f0cdfc558b345"},"tabularData":false,"creationDate":"2026-04-27","publicationDate":"2026-05-15","fileAccessRequest":true}},{"label":"19_keyword_XAMAS_ruscorpora_content_4000.csv","restricted":false,"version":1,"datasetVersionId":5688,"dataFile":{"id":276809,"persistentId":"doi:10.18710/YTSGDM/PCXJTZ","pidURL":"https://doi.org/10.18710/YTSGDM/PCXJTZ","filename":"19_keyword_XAMAS_ruscorpora_content_4000.csv","contentType":"text/comma-separated-values","friendlyType":"Comma Separated Values","filesize":2562803,"storageIdentifier":"S3://uit-dataverseno-prod01:19dcdb41cba-b6f32d385364","rootDataFileId":-1,"md5":"9424b4dd0a7df0bcbf2f4b0d9db3e49f","checksum":{"type":"MD5","value":"9424b4dd0a7df0bcbf2f4b0d9db3e49f"},"tabularData":false,"creationDate":"2026-04-27","publicationDate":"2026-05-15","fileAccessRequest":true}},{"label":"20_keyword_Palestina_ruscorpora_content_4000.csv","restricted":false,"version":1,"datasetVersionId":5688,"dataFile":{"id":276810,"persistentId":"doi:10.18710/YTSGDM/Y670WK","pidURL":"https://doi.org/10.18710/YTSGDM/Y670WK","filename":"20_keyword_Palestina_ruscorpora_content_4000.csv","contentType":"text/comma-separated-values","friendlyType":"Comma Separated Values","filesize":2611567,"storageIdentifier":"S3://uit-dataverseno-prod01:19dcdb52400-631b9b7ded49","rootDataFileId":-1,"md5":"cca43930fc4e9aa6529e7c8bc135d158","checksum":{"type":"MD5","value":"cca43930fc4e9aa6529e7c8bc135d158"},"tabularData":false,"creationDate":"2026-04-27","publicationDate":"2026-05-15","fileAccessRequest":true}},{"label":"21_keyword_Livan_ruscorpora_content_4000.csv","restricted":false,"version":1,"datasetVersionId":5688,"dataFile":{"id":276814,"persistentId":"doi:10.18710/YTSGDM/LWC4LS","pidURL":"https://doi.org/10.18710/YTSGDM/LWC4LS","filename":"21_keyword_Livan_ruscorpora_content_4000.csv","contentType":"text/comma-separated-values","friendlyType":"Comma Separated Values","filesize":2506876,"storageIdentifier":"S3://uit-dataverseno-prod01:19dcde8d3ee-3f82818205cc","rootDataFileId":-1,"md5":"ca1cb197d0007827d4c2b08eecd7afc7","checksum":{"type":"MD5","value":"ca1cb197d0007827d4c2b08eecd7afc7"},"tabularData":false,"creationDate":"2026-04-27","publicationDate":"2026-05-15","fileAccessRequest":true}},{"label":"22_keyword_Iran_ruscorpora_content_4000.csv","restricted":false,"version":1,"datasetVersionId":5688,"dataFile":{"id":276811,"persistentId":"doi:10.18710/YTSGDM/4P9XTY","pidURL":"https://doi.org/10.18710/YTSGDM/4P9XTY","filename":"22_keyword_Iran_ruscorpora_content_4000.csv","contentType":"text/comma-separated-values","friendlyType":"Comma Separated Values","filesize":2593764,"storageIdentifier":"S3://uit-dataverseno-prod01:19dcdc673ba-b043b57c68f3","rootDataFileId":-1,"md5":"3fa331f7dd24dd3728c01774de2a2dbb","checksum":{"type":"MD5","value":"3fa331f7dd24dd3728c01774de2a2dbb"},"tabularData":false,"creationDate":"2026-04-27","publicationDate":"2026-05-15","fileAccessRequest":true}},{"label":"23_keyword_Xezbolla_ruscorpora_content_20 .csv","restricted":false,"version":1,"datasetVersionId":5688,"dataFile":{"id":276812,"persistentId":"doi:10.18710/YTSGDM/KXB682","pidURL":"https://doi.org/10.18710/YTSGDM/KXB682","filename":"23_keyword_Xezbolla_ruscorpora_content_20 .csv","contentType":"text/comma-separated-values","friendlyType":"Comma Separated Values","filesize":12289,"storageIdentifier":"S3://uit-dataverseno-prod01:19dcdc85881-5eb830096a6b","rootDataFileId":-1,"md5":"2cc8cd9fe937bc03972dcf6b84311284","checksum":{"type":"MD5","value":"2cc8cd9fe937bc03972dcf6b84311284"},"tabularData":false,"creationDate":"2026-04-27","publicationDate":"2026-05-15","fileAccessRequest":true}},{"label":"24_keyword_USA_ruscorpora_content_4000.csv","restricted":false,"version":1,"datasetVersionId":5688,"dataFile":{"id":276806,"persistentId":"doi:10.18710/YTSGDM/VK3XMR","pidURL":"https://doi.org/10.18710/YTSGDM/VK3XMR","filename":"24_keyword_USA_ruscorpora_content_4000.csv","contentType":"text/comma-separated-values","friendlyType":"Comma Separated Values","filesize":2585913,"storageIdentifier":"S3://uit-dataverseno-prod01:19dcdb0aaf1-3fb15dd034ef","rootDataFileId":-1,"md5":"d0a3bd24a1aa612d9745cc1b53031bc2","checksum":{"type":"MD5","value":"d0a3bd24a1aa612d9745cc1b53031bc2"},"tabularData":false,"creationDate":"2026-04-27","publicationDate":"2026-05-15","fileAccessRequest":true}},{"label":"25_keyword_ООN_ruscorpora_content_4000.csv","restricted":false,"version":1,"datasetVersionId":5688,"dataFile":{"id":276813,"persistentId":"doi:10.18710/YTSGDM/QHOB4M","pidURL":"https://doi.org/10.18710/YTSGDM/QHOB4M","filename":"25_keyword_ООN_ruscorpora_content_4000.csv","contentType":"text/comma-separated-values","friendlyType":"Comma Separated Values","filesize":2701340,"storageIdentifier":"S3://uit-dataverseno-prod01:19dcdd784e9-9fc4cffee8b7","rootDataFileId":-1,"md5":"d73691365c32f40e87bfadac19fee83b","checksum":{"type":"MD5","value":"d73691365c32f40e87bfadac19fee83b"},"tabularData":false,"creationDate":"2026-04-27","publicationDate":"2026-05-15","fileAccessRequest":true}},{"label":"26_keyword_Rossija_ruscorpora_content_4000.csv","restricted":false,"version":1,"datasetVersionId":5688,"dataFile":{"id":276815,"persistentId":"doi:10.18710/YTSGDM/6NSYIK","pidURL":"https://doi.org/10.18710/YTSGDM/6NSYIK","filename":"26_keyword_Rossija_ruscorpora_content_4000.csv","contentType":"text/comma-separated-values","friendlyType":"Comma Separated Values","filesize":2633250,"storageIdentifier":"S3://uit-dataverseno-prod01:19dcde92fec-af5a00e55ae3","rootDataFileId":-1,"md5":"3d2ab51e5d853d8ee02a2741d59ae8d3","checksum":{"type":"MD5","value":"3d2ab51e5d853d8ee02a2741d59ae8d3"},"tabularData":false,"creationDate":"2026-04-27","publicationDate":"2026-05-15","fileAccessRequest":true}},{"label":"27_REF_Prompt_AI_annotate.txt","restricted":false,"version":1,"datasetVersionId":5688,"dataFile":{"id":276725,"persistentId":"doi:10.18710/YTSGDM/NPX0WZ","pidURL":"https://doi.org/10.18710/YTSGDM/NPX0WZ","filename":"27_REF_Prompt_AI_annotate.txt","contentType":"text/plain","friendlyType":"Plain Text","filesize":17262,"storageIdentifier":"S3://uit-dataverseno-prod01:19dba6cb957-d6cccc4ddcbc","rootDataFileId":-1,"md5":"9f79e990d6a044750a000828138d8db5","checksum":{"type":"MD5","value":"9f79e990d6a044750a000828138d8db5"},"tabularData":false,"creationDate":"2026-04-23","publicationDate":"2026-05-15","fileAccessRequest":true}},{"label":"28_REF_AI_annotated.txt","restricted":false,"version":1,"datasetVersionId":5688,"dataFile":{"id":276732,"persistentId":"doi:10.18710/YTSGDM/38Y0R1","pidURL":"https://doi.org/10.18710/YTSGDM/38Y0R1","filename":"28_REF_AI_annotated.txt","contentType":"text/plain","friendlyType":"Plain Text","filesize":12415599,"storageIdentifier":"S3://uit-dataverseno-prod01:19dba6d7647-c21d27c376b2","rootDataFileId":-1,"md5":"63e003092ee9593ae37a6cf35520e139","checksum":{"type":"MD5","value":"63e003092ee9593ae37a6cf35520e139"},"tabularData":false,"creationDate":"2026-04-23","publicationDate":"2026-05-15","fileAccessRequest":true}},{"label":"29_REF_annotation_statistics_AI.csv","restricted":false,"version":1,"datasetVersionId":5688,"dataFile":{"id":276837,"persistentId":"doi:10.18710/YTSGDM/SUW7RD","pidURL":"https://doi.org/10.18710/YTSGDM/SUW7RD","filename":"29_REF_annotation_statistics_AI.csv","contentType":"text/comma-separated-values","friendlyType":"Comma Separated Values","filesize":945,"storageIdentifier":"S3://uit-dataverseno-prod01:19dcee8b59d-f5d1d57b2d70","rootDataFileId":-1,"md5":"ac4038a767e328662b0778c2b5c3cad0","checksum":{"type":"MD5","value":"ac4038a767e328662b0778c2b5c3cad0"},"tabularData":false,"creationDate":"2026-04-27","publicationDate":"2026-05-15","fileAccessRequest":true}},{"label":"30_REF_samples_AI_annotated.txt","restricted":false,"version":1,"datasetVersionId":5688,"dataFile":{"id":276727,"persistentId":"doi:10.18710/YTSGDM/RDX209","pidURL":"https://doi.org/10.18710/YTSGDM/RDX209","filename":"30_REF_samples_AI_annotated.txt","contentType":"text/plain","friendlyType":"Plain Text","filesize":2503756,"storageIdentifier":"S3://uit-dataverseno-prod01:19dba6fd0b8-bae16cc5354c","rootDataFileId":-1,"md5":"d357282b0767030e7f2b64241fad18ed","checksum":{"type":"MD5","value":"d357282b0767030e7f2b64241fad18ed"},"tabularData":false,"creationDate":"2026-04-23","publicationDate":"2026-05-15","fileAccessRequest":true}},{"label":"31_REF_discrepancies_between_AI_and_human_annotations.csv","restricted":false,"version":1,"datasetVersionId":5688,"dataFile":{"id":276835,"persistentId":"doi:10.18710/YTSGDM/JJ5HJE","pidURL":"https://doi.org/10.18710/YTSGDM/JJ5HJE","filename":"31_REF_discrepancies_between_AI_and_human_annotations.csv","contentType":"text/comma-separated-values","friendlyType":"Comma Separated Values","filesize":85912,"storageIdentifier":"S3://uit-dataverseno-prod01:19dcec3aaf2-06651bb1843d","rootDataFileId":-1,"md5":"a34cf0e45c5ddf904337a0be38f1dcdf","checksum":{"type":"MD5","value":"a34cf0e45c5ddf904337a0be38f1dcdf"},"tabularData":false,"creationDate":"2026-04-27","publicationDate":"2026-05-15","fileAccessRequest":true}},{"label":"32_REF_annotation_statistics_human.csv","restricted":false,"version":1,"datasetVersionId":5688,"dataFile":{"id":276834,"persistentId":"doi:10.18710/YTSGDM/LUADLQ","pidURL":"https://doi.org/10.18710/YTSGDM/LUADLQ","filename":"32_REF_annotation_statistics_human.csv","contentType":"text/comma-separated-values","friendlyType":"Comma Separated Values","filesize":924,"storageIdentifier":"S3://uit-dataverseno-prod01:19dcec360ce-907a0fe5d6fb","rootDataFileId":-1,"md5":"14ea47e7959218fb17c6cb836e433cb6","checksum":{"type":"MD5","value":"14ea47e7959218fb17c6cb836e433cb6"},"tabularData":false,"creationDate":"2026-04-27","publicationDate":"2026-05-15","fileAccessRequest":true}},{"label":"33_RT_chi_square_keyword_analysis.py","restricted":false,"version":1,"datasetVersionId":5688,"dataFile":{"id":276733,"persistentId":"doi:10.18710/YTSGDM/6BLIN3","pidURL":"https://doi.org/10.18710/YTSGDM/6BLIN3","filename":"33_RT_chi_square_keyword_analysis.py","contentType":"text/x-python","friendlyType":"Python Source Code","filesize":10811,"storageIdentifier":"S3://uit-dataverseno-prod01:19dba712496-b1a859927419","rootDataFileId":-1,"md5":"53d58208abba62d57eefe80422968a05","checksum":{"type":"MD5","value":"53d58208abba62d57eefe80422968a05"},"tabularData":false,"creationDate":"2026-04-23","publicationDate":"2026-05-15","fileAccessRequest":true}},{"label":"34_RT_keywords_analysis_report.pdf","restricted":false,"version":1,"datasetVersionId":5688,"dataFile":{"id":276772,"persistentId":"doi:10.18710/YTSGDM/14ARL6","pidURL":"https://doi.org/10.18710/YTSGDM/14ARL6","filename":"34_RT_keywords_analysis_report.pdf","contentType":"application/pdf","friendlyType":"Adobe PDF","filesize":263870,"storageIdentifier":"S3://uit-dataverseno-prod01:19dbeea50ad-11a5e6f5d6fd","rootDataFileId":-1,"md5":"62febafad56c0281077a2a6f6638cc37","checksum":{"type":"MD5","value":"62febafad56c0281077a2a6f6638cc37"},"tabularData":false,"creationDate":"2026-04-24","publicationDate":"2026-05-15","fileAccessRequest":true}},{"label":"35_RT_keywords_grammatical_case_standardized_residuals_precise.csv","restricted":false,"version":1,"datasetVersionId":5688,"dataFile":{"id":276773,"persistentId":"doi:10.18710/YTSGDM/4ANCGY","pidURL":"https://doi.org/10.18710/YTSGDM/4ANCGY","filename":"35_RT_keywords_grammatical_case_standardized_residuals_precise.csv","contentType":"text/comma-separated-values","friendlyType":"Comma Separated Values","filesize":1169,"storageIdentifier":"S3://uit-dataverseno-prod01:19dbeeaa2b3-caacc89f88cc","rootDataFileId":-1,"md5":"4c1024dd77d50d1593f828a3ea9d1a60","checksum":{"type":"MD5","value":"4c1024dd77d50d1593f828a3ea9d1a60"},"tabularData":false,"creationDate":"2026-04-24","publicationDate":"2026-05-15","fileAccessRequest":true}},{"label":"36_RT_grammatical_case_standardized_residuals_heatmap.py","restricted":false,"version":1,"datasetVersionId":5688,"dataFile":{"id":277297,"persistentId":"doi:10.18710/YTSGDM/CJ8RMJ","pidURL":"https://doi.org/10.18710/YTSGDM/CJ8RMJ","filename":"36_RT_grammatical_case_standardized_residuals_heatmap.py","contentType":"text/x-python","friendlyType":"Python Source Code","filesize":10351,"storageIdentifier":"S3://uit-dataverseno-prod01:19e2a414c77-b7cdfb10d14e","rootDataFileId":-1,"md5":"f53bd3e49b7dd5af31074bad05bb3082","checksum":{"type":"MD5","value":"f53bd3e49b7dd5af31074bad05bb3082"},"tabularData":false,"creationDate":"2026-05-15","publicationDate":"2026-05-15","fileAccessRequest":true}},{"label":"37_RT_grammatical_case_standardized_residuals_heatmap.png","restricted":false,"version":1,"datasetVersionId":5688,"dataFile":{"id":277298,"persistentId":"doi:10.18710/YTSGDM/LUK1WW","pidURL":"https://doi.org/10.18710/YTSGDM/LUK1WW","filename":"37_RT_grammatical_case_standardized_residuals_heatmap.png","contentType":"image/png","friendlyType":"PNG Image","filesize":876536,"storageIdentifier":"S3://uit-dataverseno-prod01:19e2a41a59e-1c7f065597fe","rootDataFileId":-1,"md5":"9390f6054c2795531695d06c3b082682","checksum":{"type":"MD5","value":"9390f6054c2795531695d06c3b082682"},"tabularData":false,"creationDate":"2026-05-15","publicationDate":"2026-05-15","fileAccessRequest":true}},{"label":"38_REF_Log-Likelihood.py","restricted":false,"version":1,"datasetVersionId":5688,"dataFile":{"id":276708,"persistentId":"doi:10.18710/YTSGDM/5QPNFN","pidURL":"https://doi.org/10.18710/YTSGDM/5QPNFN","filename":"38_REF_Log-Likelihood.py","contentType":"text/x-python","friendlyType":"Python Source Code","filesize":8890,"storageIdentifier":"S3://uit-dataverseno-prod01:19dba71fb71-9e2673736171","rootDataFileId":-1,"md5":"0de1d443a1d01e60df44a99b392d1c0f","checksum":{"type":"MD5","value":"0de1d443a1d01e60df44a99b392d1c0f"},"tabularData":false,"creationDate":"2026-04-23","publicationDate":"2026-05-15","fileAccessRequest":true}},{"label":"39_REF_Log-Likelihood.csv","restricted":false,"version":1,"datasetVersionId":5688,"dataFile":{"id":276816,"persistentId":"doi:10.18710/YTSGDM/RIIP2W","pidURL":"https://doi.org/10.18710/YTSGDM/RIIP2W","filename":"39_REF_Log-Likelihood.csv","contentType":"text/comma-separated-values","friendlyType":"Comma Separated Values","filesize":5149,"storageIdentifier":"S3://uit-dataverseno-prod01:19dcdeb60ab-5dd5358b8e9f","rootDataFileId":-1,"md5":"033857ee54f9bf5a82cb30f784f87e9e","checksum":{"type":"MD5","value":"033857ee54f9bf5a82cb30f784f87e9e"},"tabularData":false,"creationDate":"2026-04-27","publicationDate":"2026-05-15","fileAccessRequest":true}},{"label":"40_Identification_ KMA_LL_SR.csv","restricted":false,"version":1,"datasetVersionId":5688,"dataFile":{"id":276793,"persistentId":"doi:10.18710/YTSGDM/DB3GWR","pidURL":"https://doi.org/10.18710/YTSGDM/DB3GWR","filename":"40_Identification_ KMA_LL_SR.csv","contentType":"text/comma-separated-values","friendlyType":"Comma Separated Values","filesize":3316,"storageIdentifier":"S3://uit-dataverseno-prod01:19dc392affd-8cebb68b08ba","rootDataFileId":-1,"md5":"7531620a24376ef0055a54e8ba7dd5a6","checksum":{"type":"MD5","value":"7531620a24376ef0055a54e8ba7dd5a6"},"tabularData":false,"creationDate":"2026-04-25","publicationDate":"2026-05-15","fileAccessRequest":true}},{"label":"41_Primary_Secondary_Keymorphs_Viz.py","restricted":false,"version":1,"datasetVersionId":5688,"dataFile":{"id":277299,"persistentId":"doi:10.18710/YTSGDM/LPKJ5D","pidURL":"https://doi.org/10.18710/YTSGDM/LPKJ5D","filename":"41_Primary_Secondary_Keymorphs_Viz.py","contentType":"text/x-python","friendlyType":"Python Source Code","filesize":9996,"storageIdentifier":"S3://uit-dataverseno-prod01:19e2a422a81-0b43883a2e14","rootDataFileId":-1,"md5":"cb6973fd7db4327f756eb39ea2378384","checksum":{"type":"MD5","value":"cb6973fd7db4327f756eb39ea2378384"},"tabularData":false,"creationDate":"2026-05-15","publicationDate":"2026-05-15","fileAccessRequest":true}},{"label":"42_Fig_Primary_Keymorphs_Viz.png","restricted":false,"version":1,"datasetVersionId":5688,"dataFile":{"id":277301,"persistentId":"doi:10.18710/YTSGDM/CPHARP","pidURL":"https://doi.org/10.18710/YTSGDM/CPHARP","filename":"42_Fig_Primary_Keymorphs_Viz.png","contentType":"image/png","friendlyType":"PNG Image","filesize":861886,"storageIdentifier":"S3://uit-dataverseno-prod01:19e2a4300ab-9c0c516ef714","rootDataFileId":-1,"md5":"a8ab3390512b5137b34df032d7ac0510","checksum":{"type":"MD5","value":"a8ab3390512b5137b34df032d7ac0510"},"tabularData":false,"creationDate":"2026-05-15","publicationDate":"2026-05-15","fileAccessRequest":true}},{"label":"43_Fig_Secondary_Keymorphs_Viz.png","restricted":false,"version":1,"datasetVersionId":5688,"dataFile":{"id":277300,"persistentId":"doi:10.18710/YTSGDM/IQXJAZ","pidURL":"https://doi.org/10.18710/YTSGDM/IQXJAZ","filename":"43_Fig_Secondary_Keymorphs_Viz.png","contentType":"image/png","friendlyType":"PNG Image","filesize":445284,"storageIdentifier":"S3://uit-dataverseno-prod01:19e2a42717b-f899b602aa15","rootDataFileId":-1,"md5":"5127e116a50a04ed4a70e8fa41ebfa93","checksum":{"type":"MD5","value":"5127e116a50a04ed4a70e8fa41ebfa93"},"tabularData":false,"creationDate":"2026-05-15","publicationDate":"2026-05-15","fileAccessRequest":true}}],"citation":"Lu, Tingting, 2026, \"Supporting data for: LLM-Assisted Keymorph Analysis of Grammatical Case in RT's Israeli–Palestinian Conflict Coverage\", https://doi.org/10.18710/YTSGDM, DataverseNO, V1"}}