View: |
Part 1: Document Description
|
Citation |
|
---|---|
Title: |
Replication Data for: causalizeR: A text mining algorithm to identify causal relationships in scientific literature |
Identification Number: |
doi:10.18710/PTQ8X7 |
Distributor: |
DataverseNO |
Date of Distribution: |
2021-06-16 |
Version: |
1 |
Bibliographic Citation: |
Ancin-Murguzur, Francisco Javier; Hausner, Vera Helene, 2021, "Replication Data for: causalizeR: A text mining algorithm to identify causal relationships in scientific literature", https://doi.org/10.18710/PTQ8X7, DataverseNO, V1 |
Citation |
|
Title: |
Replication Data for: causalizeR: A text mining algorithm to identify causal relationships in scientific literature |
Identification Number: |
doi:10.18710/PTQ8X7 |
Authoring Entity: |
Ancin-Murguzur, Francisco Javier (UiT The Arctic University of Norway) |
Hausner, Vera Helene (UiT The Arctic University of Norway) |
|
Producer: |
UiT The Arctic University of Norway |
Date of Production: |
2021-05-18 |
Software used in Production: |
R |
Grant Number: |
369903 |
Grant Number: |
296987 |
Distributor: |
DataverseNO |
Distributor: |
UiT The Arctic University of Norway |
Access Authority: |
Ancin-Murguzur, Francisco Javier |
Depositor: |
Ancin Murguzur, Francisco Javier |
Date of Deposit: |
2021-04-30 |
Holdings Information: |
https://doi.org/10.18710/PTQ8X7 |
Study Scope |
|
Keywords: |
Computer and Information Science, Earth and Environmental Sciences, text mining, causal link, algorithm, literature review |
Abstract: |
Complex interactions among multiple abiotic and biotic drivers result in rapid changes in ecosystems worldwide. Predicting how specific interactions can cause ripple effects potentially resulting in abrupt shifts in ecosystems is of high relevance to policymakers, but difficult to quantify using data from singular cases. We present causalizeR (https://github.com/fjmurguzur/causalizeR), a text-processing algorithm that extracts causal relations from literature based on simple grammatical rules that can be used to synthesize evidence in unstructured texts in a structured manner. The algorithm extracts causal links using the relative position of nouns relative to the keyword of choice to extract the cause and effects of interest. The resulting database can be combined with network analysis tools to estimate the direct and indirect effects of multiple drivers at the network level, which is useful for synthesizing available knowledge and for hypothesis creation and testing. We illustrate the use of the algorithm by detecting causal relationships in scientific literature relating to the tundra ecosystem. |
Date of Collection: |
2020-12-01-2020-12-01 |
Geographic Coverage: |
Tundra ecosystem |
Kind of Data: |
Algorithm in R language to perform bibliographic analyses |
Methodology and Processing |
|
Sources Statement |
|
Data Access |
|
Other Study Description Materials |
|
Related Studies |
|
https://github.com/fjmurguzur/causalizeR |
|
https://doi.org/10.5281/zenodo.4817639 |
|
Related Publications |
|
Citation |
|
Title: |
causalizeR: A text mining algorithm to identify causal relationships in scientific literature - Submitted for review |
Bibliographic Citation: |
causalizeR: A text mining algorithm to identify causal relationships in scientific literature - Submitted for review |
Label: |
00_README.txt |
Notes: |
text/plain |
Label: |
Script.R |
Text: |
Script to process the bibliographic data with the causalizeR package |
Notes: |
type/x-r-syntax |
Label: |
causalizeR.zip |
Text: |
Snapshot of the causalizeR package at the moment of publication |
Notes: |
application/zip |
Label: |
Data.zip |
Text: |
bibliographic files in .bib format as downloaded from Scopus |
Notes: |
application/zip |