Replication Data for: causalizeR: A text mining algorithm to identify causal relationships in scientific literature (doi:10.18710/PTQ8X7)

View:

Part 1: Document Description
Part 2: Study Description
Part 5: Other Study-Related Materials
Entire Codebook

Document Description

Citation

Title:

Replication Data for: causalizeR: A text mining algorithm to identify causal relationships in scientific literature

Identification Number:

doi:10.18710/PTQ8X7

Distributor:

DataverseNO

Date of Distribution:

2021-06-16

Version:

1

Bibliographic Citation:

Ancin-Murguzur, Francisco Javier; Hausner, Vera Helene, 2021, "Replication Data for: causalizeR: A text mining algorithm to identify causal relationships in scientific literature", https://doi.org/10.18710/PTQ8X7, DataverseNO, V1

Study Description

Citation

Title:

Replication Data for: causalizeR: A text mining algorithm to identify causal relationships in scientific literature

Identification Number:

doi:10.18710/PTQ8X7

Authoring Entity:

Ancin-Murguzur, Francisco Javier (UiT The Arctic University of Norway)

Hausner, Vera Helene (UiT The Arctic University of Norway)

Producer:

UiT The Arctic University of Norway

Date of Production:

2021-05-18

Software used in Production:

R

Grant Number:

369903

Grant Number:

296987

Distributor:

DataverseNO

Distributor:

UiT The Arctic University of Norway

Access Authority:

Ancin-Murguzur, Francisco Javier

Depositor:

Ancin Murguzur, Francisco Javier

Date of Deposit:

2021-04-30

Holdings Information:

https://doi.org/10.18710/PTQ8X7

Study Scope

Keywords:

Computer and Information Science, Earth and Environmental Sciences, text mining, causal link, algorithm, literature review

Abstract:

Complex interactions among multiple abiotic and biotic drivers result in rapid changes in ecosystems worldwide. Predicting how specific interactions can cause ripple effects potentially resulting in abrupt shifts in ecosystems is of high relevance to policymakers, but difficult to quantify using data from singular cases. We present causalizeR (https://github.com/fjmurguzur/causalizeR), a text-processing algorithm that extracts causal relations from literature based on simple grammatical rules that can be used to synthesize evidence in unstructured texts in a structured manner. The algorithm extracts causal links using the relative position of nouns relative to the keyword of choice to extract the cause and effects of interest. The resulting database can be combined with network analysis tools to estimate the direct and indirect effects of multiple drivers at the network level, which is useful for synthesizing available knowledge and for hypothesis creation and testing. We illustrate the use of the algorithm by detecting causal relationships in scientific literature relating to the tundra ecosystem.

Date of Collection:

2020-12-01-2020-12-01

Geographic Coverage:

Tundra ecosystem

Kind of Data:

Algorithm in R language to perform bibliographic analyses

Methodology and Processing

Sources Statement

Data Access

Other Study Description Materials

Related Studies

https://github.com/fjmurguzur/causalizeR

https://doi.org/10.5281/zenodo.4817639

Related Publications

Citation

Title:

causalizeR: A text mining algorithm to identify causal relationships in scientific literature - Submitted for review

Bibliographic Citation:

causalizeR: A text mining algorithm to identify causal relationships in scientific literature - Submitted for review

Other Study-Related Materials

Label:

00_README.txt

Notes:

text/plain

Other Study-Related Materials

Label:

Script.R

Text:

Script to process the bibliographic data with the causalizeR package

Notes:

type/x-r-syntax

Other Study-Related Materials

Label:

causalizeR.zip

Text:

Snapshot of the causalizeR package at the moment of publication

Notes:

application/zip

Other Study-Related Materials

Label:

Data.zip

Text:

bibliographic files in .bib format as downloaded from Scopus

Notes:

application/zip