{"id":86815,"identifier":"ED8HSD","persistentUrl":"https://doi.org/10.18710/ED8HSD","protocol":"doi","authority":"10.18710","publisher":"DataverseNO","publicationDate":"2020-11-30","storageIdentifier":"S3://10.18710/ED8HSD","datasetVersion":{"id":3842,"datasetId":86815,"datasetPersistentId":"doi:10.18710/ED8HSD","storageIdentifier":"S3://10.18710/ED8HSD","versionNumber":2,"versionMinorNumber":2,"versionState":"RELEASED","UNF":"UNF:6:mHk2VYyjhtEzz5muuhFrqw==","lastUpdateTime":"2023-09-28T20:53:58Z","releaseTime":"2023-09-28T20:53:58Z","createTime":"2023-09-28T15:55:25Z","publicationDate":"2020-11-30","citationDate":"2020-11-30","license":{"name":"CC0 1.0","uri":"http://creativecommons.org/publicdomain/zero/1.0","iconUri":"https://licensebuttons.net/p/zero/1.0/88x31.png"},"fileAccessRequest":true,"metadataBlocks":{"citation":{"displayName":"Citation Metadata","name":"citation","fields":[{"typeName":"title","multiple":false,"typeClass":"primitive","value":"Replication Data for: \"ramr: an R package for detection of rare aberrantly methylated regions\""},{"typeName":"author","multiple":true,"typeClass":"compound","value":[{"authorName":{"typeName":"authorName","multiple":false,"typeClass":"primitive","value":"Nikolaienko, Oleksii"},"authorAffiliation":{"typeName":"authorAffiliation","multiple":false,"typeClass":"primitive","value":"University of Bergen"},"authorIdentifierScheme":{"typeName":"authorIdentifierScheme","multiple":false,"typeClass":"controlledVocabulary","value":"ORCID"},"authorIdentifier":{"typeName":"authorIdentifier","multiple":false,"typeClass":"primitive","value":"0000-0002-5910-4934"}}]},{"typeName":"datasetContact","multiple":true,"typeClass":"compound","value":[{"datasetContactName":{"typeName":"datasetContactName","multiple":false,"typeClass":"primitive","value":"Nikolaienko, Oleksii"},"datasetContactAffiliation":{"typeName":"datasetContactAffiliation","multiple":false,"typeClass":"primitive","value":"University of Bergen"},"datasetContactEmail":{"typeName":"datasetContactEmail","multiple":false,"typeClass":"primitive","value":"oleksii.nikolaienko@uib.no"}}]},{"typeName":"dsDescription","multiple":true,"typeClass":"compound","value":[{"dsDescriptionValue":{"typeName":"dsDescriptionValue","multiple":false,"typeClass":"primitive","value":"
This data set contains all the necessary data sets (biologically-relevant\r\nsimulated data sets, preprocessed public data sets) used to evaluate\r\nperformance and obtain results using ramr (https://github.com/BBCG/ramr, http://www.bioconductor.org/packages/ramr/) -\r\na new method for identification of aberrantly methylated regions (AMRs). All\r\nthe necessary R scripts that were used for preparation, testing and analysis\r\nof data sets are also provided. For additional information please check ramr\r\npackage README.md file, vignettes or reference citation.
\r\n\r\nPlease use TREE VIEW to browse files efficiently
"},"dsDescriptionDate":{"typeName":"dsDescriptionDate","multiple":false,"typeClass":"primitive","value":"2020-11-26"}},{"dsDescriptionValue":{"typeName":"dsDescriptionValue","multiple":false,"typeClass":"primitive","value":"Abstract
\r\nWith recent advances in the field of epigenetics, the focus is widening from large and frequent disease- or phenotype-related methylation signatures to rare alterations transmitted mitotically or transgenerationally (constitutional epimutations). Merging evidence indicate that such constitutional alterations, albeit occurring at a low mosaic level, may confer risk of disease later in life. Given their inherently low incidence rate and mosaic nature, there is a need for bioinformatic tools specifically designed to analyse such events.
\r\nWe have developed a method (ramr) to identify aberrantly methylated DNA regions (AMRs). ramr can be applied to methylation data obtained by array or next-generation sequencing techniques to discover AMRs being associated with elevated risk of cancer as well as other diseases. We assessed accuracy and performance metrics of ramr and confirmed its applicability for analysis of large public data sets. Using ramr we identified aberrantly methylated regions that are known or may potentially be associated with development of colorectal cancer and provided functional annotation of AMRs that arise at early developmental stages.
"},"dsDescriptionDate":{"typeName":"dsDescriptionDate","multiple":false,"typeClass":"primitive","value":"2020-11-26"}}]},{"typeName":"subject","multiple":true,"typeClass":"controlledVocabulary","value":["Computer and Information Science","Medicine, Health and Life Sciences"]},{"typeName":"keyword","multiple":true,"typeClass":"compound","value":[{"keywordValue":{"typeName":"keywordValue","multiple":false,"typeClass":"primitive","value":"ramr"}},{"keywordValue":{"typeName":"keywordValue","multiple":false,"typeClass":"primitive","value":"DNA methylation"}},{"keywordValue":{"typeName":"keywordValue","multiple":false,"typeClass":"primitive","value":"Computational Biology"},"keywordVocabulary":{"typeName":"keywordVocabulary","multiple":false,"typeClass":"primitive","value":"MeSH"},"keywordVocabularyURI":{"typeName":"keywordVocabularyURI","multiple":false,"typeClass":"primitive","value":"https://www.ncbi.nlm.nih.gov/mesh/68019295"}},{"keywordValue":{"typeName":"keywordValue","multiple":false,"typeClass":"primitive","value":"Epigenomics"},"keywordVocabulary":{"typeName":"keywordVocabulary","multiple":false,"typeClass":"primitive","value":"MeSH"},"keywordVocabularyURI":{"typeName":"keywordVocabularyURI","multiple":false,"typeClass":"primitive","value":"https://www.ncbi.nlm.nih.gov/mesh/68057890"}}]},{"typeName":"publication","multiple":true,"typeClass":"compound","value":[{"publicationCitation":{"typeName":"publicationCitation","multiple":false,"typeClass":"primitive","value":"Oleksii Nikolaienko, Per Eystein Lønning, Stian Knappskog, ramr: an R/Bioconductor package for detection of rare aberrantly methylated regions, Bioinformatics, 2021;, btab586
"},"publicationIDType":{"typeName":"publicationIDType","multiple":false,"typeClass":"controlledVocabulary","value":"doi"},"publicationIDNumber":{"typeName":"publicationIDNumber","multiple":false,"typeClass":"primitive","value":"10.1093/bioinformatics/btab586"},"publicationURL":{"typeName":"publicationURL","multiple":false,"typeClass":"primitive","value":"https://doi.org/10.1093/bioinformatics/btab586"}},{"publicationCitation":{"typeName":"publicationCitation","multiple":false,"typeClass":"primitive","value":"ramr: an R package for detection of rare aberrantly methylated regions. Oleksii Nikolaienko, Per Eystein Lønning, Stian Knappskog. bioRxiv 2020.12.01.403501
"},"publicationIDType":{"typeName":"publicationIDType","multiple":false,"typeClass":"controlledVocabulary","value":"doi"},"publicationIDNumber":{"typeName":"publicationIDNumber","multiple":false,"typeClass":"primitive","value":"10.1101/2020.12.01.403501"},"publicationURL":{"typeName":"publicationURL","multiple":false,"typeClass":"primitive","value":"https://doi.org/10.1101/2020.12.01.403501"}}]},{"typeName":"language","multiple":true,"typeClass":"controlledVocabulary","value":["English"]},{"typeName":"producer","multiple":true,"typeClass":"compound","value":[{"producerName":{"typeName":"producerName","multiple":false,"typeClass":"primitive","value":"University of Bergen"},"producerAbbreviation":{"typeName":"producerAbbreviation","multiple":false,"typeClass":"primitive","value":"UiB"},"producerURL":{"typeName":"producerURL","multiple":false,"typeClass":"primitive","value":"https://www.uib.no/en"}}]},{"typeName":"distributor","multiple":true,"typeClass":"compound","value":[{"distributorName":{"typeName":"distributorName","multiple":false,"typeClass":"primitive","value":"University of Bergen"},"distributorAffiliation":{"typeName":"distributorAffiliation","multiple":false,"typeClass":"primitive","value":"University of Bergen"},"distributorURL":{"typeName":"distributorURL","multiple":false,"typeClass":"primitive","value":"https://dataverse.no/dataverse/uib"}}]},{"typeName":"depositor","multiple":false,"typeClass":"primitive","value":"Nikolaienko, Oleksii"},{"typeName":"dateOfDeposit","multiple":false,"typeClass":"primitive","value":"2020-11-24"},{"typeName":"kindOfData","multiple":true,"typeClass":"primitive","value":["program source code","machine-readable data","machine-readable text"]},{"typeName":"software","multiple":true,"typeClass":"compound","value":[{"softwareName":{"typeName":"softwareName","multiple":false,"typeClass":"primitive","value":"R"},"softwareVersion":{"typeName":"softwareVersion","multiple":false,"typeClass":"primitive","value":"3.6.3"}},{"softwareName":{"typeName":"softwareName","multiple":false,"typeClass":"primitive","value":"comb-p"},"softwareVersion":{"typeName":"softwareVersion","multiple":false,"typeClass":"primitive","value":"0.50.3"}},{"softwareName":{"typeName":"softwareName","multiple":false,"typeClass":"primitive","value":"ramr"},"softwareVersion":{"typeName":"softwareVersion","multiple":false,"typeClass":"primitive","value":"1.1.2"}}]},{"typeName":"relatedMaterial","multiple":true,"typeClass":"primitive","value":["ramr package code, version 1.1.2"]},{"typeName":"relatedDatasets","multiple":true,"typeClass":"primitive","value":["GSE105018: Whole blood DNA methylation profiles in participants of the Environmental Risk (E-Risk) Longitudinal Twin Study at age 18.","GSE51032: This Series contains data from 845 participants (188 men and 657 women) in the EPIC-Italy cohort that was produced at the Human Genetics Foundation (HuGeF) in Turin, Italy.","GSE98149: Reprogramming of H3K9me3-dependent heterochromatin during mammalian early embryo development [ChIP-seq].","TCGA-COAD: The Cancer Genome Atlas Colon Adenocarcinoma data."]},{"typeName":"otherReferences","multiple":true,"typeClass":"primitive","value":["https://github.com/BBCG/ramr","http://www.bioconductor.org/packages/ramr/"]}]},"biomedical":{"displayName":"Life Sciences Metadata","name":"biomedical","fields":[{"typeName":"studyDesignType","multiple":true,"typeClass":"controlledVocabulary","value":["Not Specified"]},{"typeName":"studyFactorType","multiple":true,"typeClass":"controlledVocabulary","value":["Age","Developmental Stage","Disease State","Immunoprecipitation Antibody","Sex"]},{"typeName":"studyAssayOrganism","multiple":true,"typeClass":"controlledVocabulary","value":["Homo sapiens","Mus musculus"]},{"typeName":"studyAssayMeasurementType","multiple":true,"typeClass":"controlledVocabulary","value":["DNA methylation profiling","Histone Modification (ChIP-Seq)"]},{"typeName":"studyAssayTechnologyType","multiple":true,"typeClass":"controlledVocabulary","value":["DNA microarray"]},{"typeName":"studyAssayPlatform","multiple":true,"typeClass":"controlledVocabulary","value":["Illumina"]}]}},"files":[{"description":"The README file","label":"00-README.txt","restricted":false,"version":1,"datasetVersionId":3842,"dataFile":{"id":101402,"persistentId":"doi:10.18710/ED8HSD/M7MHZB","pidURL":"https://doi.org/10.18710/ED8HSD/M7MHZB","filename":"00-README.txt","contentType":"text/plain","filesize":16317,"description":"The README file","storageIdentifier":"S3://2002-yellow-dataverseno:17a3d96ec54-8923592bb8a5","rootDataFileId":-1,"md5":"eb13e96e7bbfc83eb3e63272dbc11b1d","checksum":{"type":"MD5","value":"eb13e96e7bbfc83eb3e63272dbc11b1d"},"creationDate":"2021-06-24"}},{"description":"An R data file containing GRanges object with all the gene coordinates from GRCh37 annotation.","label":"GRCh37.p13.2019.12.12.Rdata","restricted":false,"directoryLabel":"data/resources","version":1,"datasetVersionId":3842,"dataFile":{"id":86836,"persistentId":"doi:10.18710/ED8HSD/GHXOQF","pidURL":"https://doi.org/10.18710/ED8HSD/GHXOQF","filename":"GRCh37.p13.2019.12.12.Rdata","contentType":"application/gzip","filesize":1521322,"description":"An R data file containing GRanges object with all the gene coordinates from GRCh37 annotation.","storageIdentifier":"S3://2002-yellow-dataverseno:175f9f23544-8e52abdd05fd","rootDataFileId":-1,"md5":"68fee512d0d9804f838056fdbcf3e043","checksum":{"type":"MD5","value":"68fee512d0d9804f838056fdbcf3e043"},"creationDate":"2020-11-24"}},{"description":"An R data file containing GRanges object with all the gene coordinates from GRCh38 annotation.","label":"GRCh38.p13.2020.08.10.Rdata","restricted":false,"directoryLabel":"data/resources","version":1,"datasetVersionId":3842,"dataFile":{"id":86838,"persistentId":"doi:10.18710/ED8HSD/NRMCKB","pidURL":"https://doi.org/10.18710/ED8HSD/NRMCKB","filename":"GRCh38.p13.2020.08.10.Rdata","contentType":"application/gzip","filesize":1651340,"description":"An R data file containing GRanges object with all the gene coordinates from GRCh38 annotation.","storageIdentifier":"S3://2002-yellow-dataverseno:175f9f2361c-3fc286a298cf","rootDataFileId":-1,"md5":"eb0741660c0a67604f5ec66c086ae4d1","checksum":{"type":"MD5","value":"eb0741660c0a67604f5ec66c086ae4d1"},"creationDate":"2020-11-24"}},{"description":"An R data file containing the following objects: 1. geo.ranges: a GRanges object containing beta values for 430802 CpGs across 1658 samples from GSE105018 Illumina HumanMethylation 450 data set (https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE105018). 2. sample.info: a list containing sample metadata for all samples in the GSE105018 data set. 3. sample.ids: a character vector of sample IDs.","label":"GSE105018.data.Rdata","restricted":false,"directoryLabel":"data/GSE105018","version":1,"datasetVersionId":3842,"dataFile":{"id":86835,"persistentId":"doi:10.18710/ED8HSD/JYV30Q","pidURL":"https://doi.org/10.18710/ED8HSD/JYV30Q","filename":"GSE105018.data.Rdata","contentType":"application/gzip","filesize":5380541163,"description":"An R data file containing the following objects: 1. geo.ranges: a GRanges object containing beta values for 430802 CpGs across 1658 samples from GSE105018 Illumina HumanMethylation 450 data set (https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE105018). 2. sample.info: a list containing sample metadata for all samples in the GSE105018 data set. 3. sample.ids: a character vector of sample IDs.","storageIdentifier":"S3://2002-yellow-dataverseno:175f9e096ad-c0e7ea95a2ce","rootDataFileId":-1,"md5":"fa05d2ac06de7cb282a47b1830319f70","checksum":{"type":"MD5","value":"fa05d2ac06de7cb282a47b1830319f70"},"creationDate":"2020-11-24"}},{"description":"An R data file containing the following objects: 1. geo.ranges: a GRanges object containing beta values for 485512 CpGs across 845 samples from GSE51032 Illumina HumanMethylation 450 data set (https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE51032). This data set was used as a template to create simulated test data sets. 2. sample.info: a list containing sample metadata for all samples in the GSE51032 data set. 3. sample.ids: a character vector of sample IDs.","label":"GSE51032.data.Rdata","restricted":false,"directoryLabel":"data/GSE51032","version":1,"datasetVersionId":3842,"dataFile":{"id":86834,"persistentId":"doi:10.18710/ED8HSD/TN0SKC","pidURL":"https://doi.org/10.18710/ED8HSD/TN0SKC","filename":"GSE51032.data.Rdata","contentType":"application/gzip","filesize":3085270135,"description":"An R data file containing the following objects: 1. geo.ranges: a GRanges object containing beta values for 485512 CpGs across 845 samples from GSE51032 Illumina HumanMethylation 450 data set (https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE51032). This data set was used as a template to create simulated test data sets. 2. sample.info: a list containing sample metadata for all samples in the GSE51032 data set. 3. sample.ids: a character vector of sample IDs.","storageIdentifier":"S3://2002-yellow-dataverseno:175f9df2b49-607f11d5f07d","rootDataFileId":-1,"md5":"3bca6c45a39974282b261a4e1d69c88a","checksum":{"type":"MD5","value":"3bca6c45a39974282b261a4e1d69c88a"},"creationDate":"2020-11-24"}},{"description":"an R script to reproduce all findings from GSE105018 data set.","label":"PoC.results.GSE105018.R","restricted":false,"directoryLabel":"R","version":1,"datasetVersionId":3842,"dataFile":{"id":86817,"persistentId":"doi:10.18710/ED8HSD/W5EREX","pidURL":"https://doi.org/10.18710/ED8HSD/W5EREX","filename":"PoC.results.GSE105018.R","contentType":"type/x-r-syntax","filesize":17549,"description":"an R script to reproduce all findings from GSE105018 data set.","storageIdentifier":"S3://2002-yellow-dataverseno:175f9c72b73-cd46caa4bf8a","rootDataFileId":-1,"md5":"18ca9c6334dbfdf28ea192ce4c1ac37b","checksum":{"type":"MD5","value":"18ca9c6334dbfdf28ea192ce4c1ac37b"},"creationDate":"2020-11-24"}},{"description":"an R script to reproduce all findings from GSE51032 data set.","label":"PoC.results.GSE51032.R","restricted":false,"directoryLabel":"R","version":1,"datasetVersionId":3842,"dataFile":{"id":86818,"persistentId":"doi:10.18710/ED8HSD/BX8E2A","pidURL":"https://doi.org/10.18710/ED8HSD/BX8E2A","filename":"PoC.results.GSE51032.R","contentType":"type/x-r-syntax","filesize":20954,"description":"an R script to reproduce all findings from GSE51032 data set.","storageIdentifier":"S3://2002-yellow-dataverseno:175f9c729fd-b2c96c58ee53","rootDataFileId":-1,"md5":"e63a711a730716f94e45c8f3590ac427","checksum":{"type":"MD5","value":"e63a711a730716f94e45c8f3590ac427"},"creationDate":"2020-11-24"}},{"description":"an R script that evaluates method performance for ramr and other methods across all test scenarios. NB that some methods (e.g. comb-p) are slow, thus modern hardware and parallelism is advised. As comb-p uses at least four threads by default, it was forced to run in a single threaded mode by limiting \"processes\" parameter of multiprocessing.pool.Pool call to \"1\" in comb-p sorce code (cpv/_common.py).","label":"PoC.results.SIMULATED.R","restricted":false,"directoryLabel":"R","version":1,"datasetVersionId":3842,"dataFile":{"id":86820,"persistentId":"doi:10.18710/ED8HSD/T2X1OF","pidURL":"https://doi.org/10.18710/ED8HSD/T2X1OF","filename":"PoC.results.SIMULATED.R","contentType":"type/x-r-syntax","filesize":20272,"description":"an R script that evaluates method performance for ramr and other methods across all test scenarios. NB that some methods (e.g. comb-p) are slow, thus modern hardware and parallelism is advised. As comb-p uses at least four threads by default, it was forced to run in a single threaded mode by limiting \"processes\" parameter of multiprocessing.pool.Pool call to \"1\" in comb-p sorce code (cpv/_common.py).","storageIdentifier":"S3://2002-yellow-dataverseno:175f9c7259c-f87fd4775884","rootDataFileId":-1,"md5":"8d0ec67128de120205491f21593039c5","checksum":{"type":"MD5","value":"8d0ec67128de120205491f21593039c5"},"creationDate":"2020-11-24"}},{"description":"an R script that finds all aberantly methylated regions in a subset of adjacent tissue samples from TCGA-COAD.","label":"PoC.results.TCGA-COAD.R","restricted":false,"directoryLabel":"R","version":1,"datasetVersionId":3842,"dataFile":{"id":86816,"persistentId":"doi:10.18710/ED8HSD/4GC9KD","pidURL":"https://doi.org/10.18710/ED8HSD/4GC9KD","filename":"PoC.results.TCGA-COAD.R","contentType":"type/x-r-syntax","filesize":4861,"description":"an R script that finds all aberantly methylated regions in a subset of adjacent tissue samples from TCGA-COAD.","storageIdentifier":"S3://2002-yellow-dataverseno:175f9c72888-ff2e094456d6","rootDataFileId":-1,"md5":"3abf3cebe4e06dddd668beb1a86c5536","checksum":{"type":"MD5","value":"3abf3cebe4e06dddd668beb1a86c5536"},"creationDate":"2020-11-24"}},{"description":"an R script that generates simulated test data sets using GSE51032 as a template.","label":"PoC.writer.SIMULATED.R","restricted":false,"directoryLabel":"R","version":1,"datasetVersionId":3842,"dataFile":{"id":86821,"persistentId":"doi:10.18710/ED8HSD/CKQOAK","pidURL":"https://doi.org/10.18710/ED8HSD/CKQOAK","filename":"PoC.writer.SIMULATED.R","contentType":"type/x-r-syntax","filesize":4476,"description":"an R script that generates simulated test data sets using GSE51032 as a template.","storageIdentifier":"S3://2002-yellow-dataverseno:175f9c72717-3168032aaa0c","rootDataFileId":-1,"md5":"dbe46328579edc04f753d3d1407edffa","checksum":{"type":"MD5","value":"dbe46328579edc04f753d3d1407edffa"},"creationDate":"2020-11-24"}},{"description":"an R script that checks LOLA analysis specificity","label":"REVISION-LOLA-bootstrap.R","restricted":false,"directoryLabel":"revision/R","version":1,"datasetVersionId":3842,"dataFile":{"id":101257,"persistentId":"doi:10.18710/ED8HSD/KMKI2C","pidURL":"https://doi.org/10.18710/ED8HSD/KMKI2C","filename":"REVISION-LOLA-bootstrap.R","contentType":"type/x-r-syntax","filesize":1022,"description":"an R script that checks LOLA analysis specificity","storageIdentifier":"S3://2002-yellow-dataverseno:17a2de25ed1-b9117f565c64","rootDataFileId":-1,"md5":"9b83a389cd1acc4c166316bc35275226","checksum":{"type":"MD5","value":"9b83a389cd1acc4c166316bc35275226"},"creationDate":"2021-06-21"}},{"description":"An R data file generated\r\n by revision/R/REVISION-performance-metrics.R and containing the following objects:\r\n 1. simulated.ranges: a GRanges object containing simulated beta values for\r\n 485512 CpGs across 100 samples with aberrant methylation events\r\n introduced in some of the regions.\r\n 2. modified.regions.unique: a GRanges object containing genomic ranges and\r\n sample ID for 1000 true positive unique aberrantly methylated regions.\r\n 3. modified.regions.nonunique: a GRanges object containing genomic ranges\r\n and sample ID for 3000 true positive non-unique aberrantly methylated\r\n regions.\r\n 4. modified.regions.noise: a GRanges object containing genomic ranges\r\n and sample ID for 1000 single-base aberrations.\r\n 5. delta: delta value by which CpG beta values corresponding to particular\r\n region/sample pair in \"simulated.ranges\" were increased or decreased.\r\n 6. merge.window: \"merge.window\" parameter value to use during testing.\r\n 7. min.cpgs: \"min.cpgs\" parameter value to use during testing.\r\n 8. sample.ids: a character vector of sample IDs.","label":"REVISION-SIMULATED-5-1000-0.025.data.Rdata","restricted":false,"directoryLabel":"revision/data","version":1,"datasetVersionId":3842,"dataFile":{"id":101255,"persistentId":"doi:10.18710/ED8HSD/A39C1D","pidURL":"https://doi.org/10.18710/ED8HSD/A39C1D","filename":"REVISION-SIMULATED-5-1000-0.025.data.Rdata","contentType":"application/gzip","filesize":368960794,"description":"An R data file generated\r\n by revision/R/REVISION-performance-metrics.R and containing the following objects:\r\n 1. simulated.ranges: a GRanges object containing simulated beta values for\r\n 485512 CpGs across 100 samples with aberrant methylation events\r\n introduced in some of the regions.\r\n 2. modified.regions.unique: a GRanges object containing genomic ranges and\r\n sample ID for 1000 true positive unique aberrantly methylated regions.\r\n 3. modified.regions.nonunique: a GRanges object containing genomic ranges\r\n and sample ID for 3000 true positive non-unique aberrantly methylated\r\n regions.\r\n 4. modified.regions.noise: a GRanges object containing genomic ranges\r\n and sample ID for 1000 single-base aberrations.\r\n 5. delta: delta value by which CpG beta values corresponding to particular\r\n region/sample pair in \"simulated.ranges\" were increased or decreased.\r\n 6. merge.window: \"merge.window\" parameter value to use during testing.\r\n 7. min.cpgs: \"min.cpgs\" parameter value to use during testing.\r\n 8. sample.ids: a character vector of sample IDs.","storageIdentifier":"S3://2002-yellow-dataverseno:17a2de27bd4-8e8bdf396e4d","rootDataFileId":-1,"md5":"391c42062b30ee31d93cd5b7d7b71e76","checksum":{"type":"MD5","value":"391c42062b30ee31d93cd5b7d7b71e76"},"creationDate":"2021-06-21"}},{"description":"An R data file which is\r\n similar to \"revision∕data/REVISION-SIMULATED-5-1000-0.025.data.Rdata\", but was\r\n generated using \"delta=0.050\". File was generated by\r\n revision/R/REVISION-performance-metrics.R.","label":"REVISION-SIMULATED-5-1000-0.050.data.Rdata","restricted":false,"directoryLabel":"revision/data","version":1,"datasetVersionId":3842,"dataFile":{"id":101264,"persistentId":"doi:10.18710/ED8HSD/OASNJG","pidURL":"https://doi.org/10.18710/ED8HSD/OASNJG","filename":"REVISION-SIMULATED-5-1000-0.050.data.Rdata","contentType":"application/gzip","filesize":368961185,"description":"An R data file which is\r\n similar to \"revision∕data/REVISION-SIMULATED-5-1000-0.025.data.Rdata\", but was\r\n generated using \"delta=0.050\". File was generated by\r\n revision/R/REVISION-performance-metrics.R.","storageIdentifier":"S3://2002-yellow-dataverseno:17a2de2a33c-5e6ff24e7073","rootDataFileId":-1,"md5":"caa36d3231cf4de883810174fffd40b0","checksum":{"type":"MD5","value":"caa36d3231cf4de883810174fffd40b0"},"creationDate":"2021-06-21"}},{"description":"An R data file which is\r\n similar to \"revision∕data/REVISION-SIMULATED-5-1000-0.025.data.Rdata\", but was\r\n generated using \"delta=0.100\". File was generated by\r\n revision/R/REVISION-performance-metrics.R.","label":"REVISION-SIMULATED-5-1000-0.100.data.Rdata","restricted":false,"directoryLabel":"revision/data","version":1,"datasetVersionId":3842,"dataFile":{"id":101268,"persistentId":"doi:10.18710/ED8HSD/ZC3SL8","pidURL":"https://doi.org/10.18710/ED8HSD/ZC3SL8","filename":"REVISION-SIMULATED-5-1000-0.100.data.Rdata","contentType":"application/gzip","filesize":368959741,"description":"An R data file which is\r\n similar to \"revision∕data/REVISION-SIMULATED-5-1000-0.025.data.Rdata\", but was\r\n generated using \"delta=0.100\". File was generated by\r\n revision/R/REVISION-performance-metrics.R.","storageIdentifier":"S3://2002-yellow-dataverseno:17a2de2c9f5-d308d137dbdd","rootDataFileId":-1,"md5":"9e947ff65b598d383386507df250b344","checksum":{"type":"MD5","value":"9e947ff65b598d383386507df250b344"},"creationDate":"2021-06-21"}},{"description":"An R data file which is\r\n similar to \"revision∕data/REVISION-SIMULATED-5-1000-0.025.data.Rdata\", but was\r\n generated using \"delta=0.250\". File was generated by\r\n revision/R/REVISION-performance-metrics.R.","label":"REVISION-SIMULATED-5-1000-0.250.data.Rdata","restricted":false,"directoryLabel":"revision/data","version":1,"datasetVersionId":3842,"dataFile":{"id":101252,"persistentId":"doi:10.18710/ED8HSD/PS5UGE","pidURL":"https://doi.org/10.18710/ED8HSD/PS5UGE","filename":"REVISION-SIMULATED-5-1000-0.250.data.Rdata","contentType":"application/gzip","filesize":368946604,"description":"An R data file which is\r\n similar to \"revision∕data/REVISION-SIMULATED-5-1000-0.025.data.Rdata\", but was\r\n generated using \"delta=0.250\". File was generated by\r\n revision/R/REVISION-performance-metrics.R.","storageIdentifier":"S3://2002-yellow-dataverseno:17a2de2ead0-ab08d2ba5231","rootDataFileId":-1,"md5":"ecdcc30e578c9ab0bba128f336c43517","checksum":{"type":"MD5","value":"ecdcc30e578c9ab0bba128f336c43517"},"creationDate":"2021-06-21"}},{"description":"An R data file which is\r\n similar to \"revision∕data/REVISION-SIMULATED-5-1000-0.025.data.Rdata\", but was\r\n generated using \"delta=0.500\". File was generated by\r\n revision/R/REVISION-performance-metrics.R.","label":"REVISION-SIMULATED-5-1000-0.500.data.Rdata","restricted":false,"directoryLabel":"revision/data","version":1,"datasetVersionId":3842,"dataFile":{"id":101260,"persistentId":"doi:10.18710/ED8HSD/Y6C3RG","pidURL":"https://doi.org/10.18710/ED8HSD/Y6C3RG","filename":"REVISION-SIMULATED-5-1000-0.500.data.Rdata","contentType":"application/gzip","filesize":368926141,"description":"An R data file which is\r\n similar to \"revision∕data/REVISION-SIMULATED-5-1000-0.025.data.Rdata\", but was\r\n generated using \"delta=0.500\". File was generated by\r\n revision/R/REVISION-performance-metrics.R.","storageIdentifier":"S3://2002-yellow-dataverseno:17a2de30c36-b00fae62b6f2","rootDataFileId":-1,"md5":"b89032cf0bb2aac9080ab751d9115c96","checksum":{"type":"MD5","value":"b89032cf0bb2aac9080ab751d9115c96"},"creationDate":"2021-06-21"}},{"label":"REVISION-SIMULATED-plots-5-1000.pdf","restricted":false,"directoryLabel":"revision/data","version":1,"datasetVersionId":3842,"dataFile":{"id":101265,"persistentId":"doi:10.18710/ED8HSD/YB4C4J","pidURL":"https://doi.org/10.18710/ED8HSD/YB4C4J","filename":"REVISION-SIMULATED-plots-5-1000.pdf","contentType":"application/pdf","filesize":7896709,"storageIdentifier":"S3://2002-yellow-dataverseno:17a2de31759-5f9d4226deb8","rootDataFileId":-1,"md5":"42fde5a3cf7b0184feee9949b67671f0","checksum":{"type":"MD5","value":"42fde5a3cf7b0184feee9949b67671f0"},"creationDate":"2021-06-21"}},{"description":"An R data file with all the performance\r\n metrics obtained by revision/R/REVISION-performance-metrics.R.","label":"REVISION-SIMULATED-results-5-1000.Rdata","restricted":false,"directoryLabel":"revision/data","version":1,"datasetVersionId":3842,"dataFile":{"id":101263,"persistentId":"doi:10.18710/ED8HSD/UJXO1U","pidURL":"https://doi.org/10.18710/ED8HSD/UJXO1U","filename":"REVISION-SIMULATED-results-5-1000.Rdata","contentType":"application/gzip","filesize":610837,"description":"An R data file with all the performance\r\n metrics obtained by revision/R/REVISION-performance-metrics.R.","storageIdentifier":"S3://2002-yellow-dataverseno:17a2de33ed8-3c57d8289b78","rootDataFileId":-1,"md5":"cb84f3b8c0131c1527f7b2077049e698","checksum":{"type":"MD5","value":"cb84f3b8c0131c1527f7b2077049e698"},"creationDate":"2021-06-21"}},{"description":"detailed performance metrics\r\n obtained by revision/R/REVISION-performance-metrics.R. Decription of columns:\r\n 01. lib: library used\r\n 02. delta: delta beta of aberrant methylation events\r\n 03. cutoff: q-value cutoff parameter for the search\r\n 04. user.self: R system.time() output value\r\n 05. sys.self: R system.time() output value\r\n 06. elapsed: R system.time() output value\r\n 07. user.child: R system.time() output value\r\n 08. sys.child: R system.time() output value\r\n 09. uGTP: number of ground true positive unique AMRs\r\n 10. uTP: number of true positive unique AMRs found\r\n 11. nGTP: number of ground true positive non-unique AMRs\r\n 12. nTP: number of true positive non-unique AMRs found\r\n 13. tFP: total number of false positive AMRs\r\n 14. utpCor: average correlation coefficient for true positive unique AMRs found\r\n 15. ntpCor: average correlation coefficient for true positive non-unique AMRs found\r\n 16. ufnCor: average correlation coefficient for false negative unique AMRs\r\n 17. nfnCor: average correlation coefficient for false negative non-unique AMRs\r\n 18. tGTN: total number of ground true negative AMRs\r\n 19. uFN: number of false negative unique AMRs\r\n 20. nFN: number of false negative non-unique AMRs\r\n 21. tTN: total number of true negative AMRs\r\n 22. tFPR: total false positive AMR rate\r\n 23. uPrecision: precision metric for unique AMRs\r\n 24. uRecall: recall metric for unique AMRs\r\n 25. uMCC: Matthews correlation coefficient metric for unique AMRs\r\n 26. uF1: F1 metric for unique AMRs\r\n 27. uAuROC: area under the recall-tFPR curve for unique AMRs\r\n 28. uAuPR: area under the precision-recall curve for unique AMRs\r\n 29. nPrecision: precision metric for non-unique AMRs\r\n 30. nRecall: recall metric for non-unique AMRs\r\n 31. nMCC: Matthews correlation coefficient metric for non-unique AMRs\r\n 32. nF1: F1 metric for non-unique AMRs\r\n 33. nAuROC: area under the recall-tFPR curve for non-unique AMRs\r\n 34. nAuPR: area under the precision-recall curve for non-unique AMRs","label":"REVISION-SIMULATED-results-5-1000.results.tab","restricted":false,"directoryLabel":"revision/data","version":1,"datasetVersionId":3842,"dataFile":{"id":101253,"persistentId":"doi:10.18710/ED8HSD/X64KU6","pidURL":"https://doi.org/10.18710/ED8HSD/X64KU6","filename":"REVISION-SIMULATED-results-5-1000.results.tab","contentType":"text/tab-separated-values","filesize":165276,"description":"detailed performance metrics\r\n obtained by revision/R/REVISION-performance-metrics.R. Decription of columns:\r\n 01. lib: library used\r\n 02. delta: delta beta of aberrant methylation events\r\n 03. cutoff: q-value cutoff parameter for the search\r\n 04. user.self: R system.time() output value\r\n 05. sys.self: R system.time() output value\r\n 06. elapsed: R system.time() output value\r\n 07. user.child: R system.time() output value\r\n 08. sys.child: R system.time() output value\r\n 09. uGTP: number of ground true positive unique AMRs\r\n 10. uTP: number of true positive unique AMRs found\r\n 11. nGTP: number of ground true positive non-unique AMRs\r\n 12. nTP: number of true positive non-unique AMRs found\r\n 13. tFP: total number of false positive AMRs\r\n 14. utpCor: average correlation coefficient for true positive unique AMRs found\r\n 15. ntpCor: average correlation coefficient for true positive non-unique AMRs found\r\n 16. ufnCor: average correlation coefficient for false negative unique AMRs\r\n 17. nfnCor: average correlation coefficient for false negative non-unique AMRs\r\n 18. tGTN: total number of ground true negative AMRs\r\n 19. uFN: number of false negative unique AMRs\r\n 20. nFN: number of false negative non-unique AMRs\r\n 21. tTN: total number of true negative AMRs\r\n 22. tFPR: total false positive AMR rate\r\n 23. uPrecision: precision metric for unique AMRs\r\n 24. uRecall: recall metric for unique AMRs\r\n 25. uMCC: Matthews correlation coefficient metric for unique AMRs\r\n 26. uF1: F1 metric for unique AMRs\r\n 27. uAuROC: area under the recall-tFPR curve for unique AMRs\r\n 28. uAuPR: area under the precision-recall curve for unique AMRs\r\n 29. nPrecision: precision metric for non-unique AMRs\r\n 30. nRecall: recall metric for non-unique AMRs\r\n 31. nMCC: Matthews correlation coefficient metric for non-unique AMRs\r\n 32. nF1: F1 metric for non-unique AMRs\r\n 33. nAuROC: area under the recall-tFPR curve for non-unique AMRs\r\n 34. nAuPR: area under the precision-recall curve for non-unique AMRs","storageIdentifier":"S3://2002-yellow-dataverseno:17a2de33fbe-a7c1a30f9738","originalFileFormat":"text/tsv","originalFormatLabel":"Tab-Separated Values","originalFileSize":166981,"originalFileName":"REVISION-SIMULATED-results-5-1000.results.tsv","UNF":"UNF:6:v552E9gD4TbKA1rHJX8rRg==","rootDataFileId":-1,"md5":"c624e57ae14c7908e809c06a7b829346","checksum":{"type":"MD5","value":"c624e57ae14c7908e809c06a7b829346"},"creationDate":"2021-06-21"}},{"description":"plots for performance metrics\r\n generated by revision/R/REVISION-performance-metrics.R.","label":"REVISION-SIMULATED-suppl-plots-5-1000.pdf","restricted":false,"directoryLabel":"revision/data","version":1,"datasetVersionId":3842,"dataFile":{"id":101259,"persistentId":"doi:10.18710/ED8HSD/XB7GAQ","pidURL":"https://doi.org/10.18710/ED8HSD/XB7GAQ","filename":"REVISION-SIMULATED-suppl-plots-5-1000.pdf","contentType":"application/pdf","filesize":102470,"description":"plots for performance metrics\r\n generated by revision/R/REVISION-performance-metrics.R.","storageIdentifier":"S3://2002-yellow-dataverseno:17a2de3367a-7411604ea8e1","rootDataFileId":-1,"md5":"913fd543c937654af525e76e27f735db","checksum":{"type":"MD5","value":"913fd543c937654af525e76e27f735db"},"creationDate":"2021-06-21"}},{"description":"top results\r\n generated by revision/R/REVISION-performance-metrics.R.","label":"REVISION-SIMULATED-suppl-table-5-1000.pdf","restricted":false,"directoryLabel":"revision/data","version":1,"datasetVersionId":3842,"dataFile":{"id":101254,"persistentId":"doi:10.18710/ED8HSD/XGR7SO","pidURL":"https://doi.org/10.18710/ED8HSD/XGR7SO","filename":"REVISION-SIMULATED-suppl-table-5-1000.pdf","contentType":"application/pdf","filesize":10368,"description":"top results\r\n generated by revision/R/REVISION-performance-metrics.R.","storageIdentifier":"S3://2002-yellow-dataverseno:17a2de33ad1-fcf5b5b8ba2f","rootDataFileId":-1,"md5":"7fda80aa84392eadab8c6261c9139e8b","checksum":{"type":"MD5","value":"7fda80aa84392eadab8c6261c9139e8b"},"creationDate":"2021-06-21"}},{"description":"time plot\r\n generated by revision/R/REVISION-performance-metrics.R.","label":"REVISION-SIMULATED-suppl-time-5-1000.pdf","restricted":false,"directoryLabel":"revision/data","version":1,"datasetVersionId":3842,"dataFile":{"id":101267,"persistentId":"doi:10.18710/ED8HSD/CFUARQ","pidURL":"https://doi.org/10.18710/ED8HSD/CFUARQ","filename":"REVISION-SIMULATED-suppl-time-5-1000.pdf","contentType":"application/pdf","filesize":5298,"description":"time plot\r\n generated by revision/R/REVISION-performance-metrics.R.","storageIdentifier":"S3://2002-yellow-dataverseno:17a2de340b8-cd35346430d4","rootDataFileId":-1,"md5":"71f55b4ef65cec167d8bb8255723c42d","checksum":{"type":"MD5","value":"71f55b4ef65cec167d8bb8255723c42d"},"creationDate":"2021-06-21"}},{"description":"an R script that builds ramr flowchart","label":"REVISION-algorithm-diagram.R","restricted":false,"directoryLabel":"revision/R","version":1,"datasetVersionId":3842,"dataFile":{"id":101262,"persistentId":"doi:10.18710/ED8HSD/O4NSIN","pidURL":"https://doi.org/10.18710/ED8HSD/O4NSIN","filename":"REVISION-algorithm-diagram.R","contentType":"type/x-r-syntax","filesize":1603,"description":"an R script that builds ramr flowchart","storageIdentifier":"S3://2002-yellow-dataverseno:17a2de25ff9-a4a33c147d18","rootDataFileId":-1,"md5":"56e9be779fc1dc036e52646b9cbc219a","checksum":{"type":"MD5","value":"56e9be779fc1dc036e52646b9cbc219a"},"creationDate":"2021-06-21"}},{"description":"an R script that compares template and simulated data sets","label":"REVISION-dataset-similarity.R","restricted":false,"directoryLabel":"revision/R","version":1,"datasetVersionId":3842,"dataFile":{"id":101266,"persistentId":"doi:10.18710/ED8HSD/OHUKFT","pidURL":"https://doi.org/10.18710/ED8HSD/OHUKFT","filename":"REVISION-dataset-similarity.R","contentType":"type/x-r-syntax","filesize":1433,"description":"an R script that compares template and simulated data sets","storageIdentifier":"S3://2002-yellow-dataverseno:17a2de25ad9-032382feb483","rootDataFileId":-1,"md5":"36c3d2c86d25a62443a45ee633c4eb41","checksum":{"type":"MD5","value":"36c3d2c86d25a62443a45ee633c4eb41"},"creationDate":"2021-06-21"}},{"description":"an R script that generates simulated test data sets using GSE51032 as a template and evaluates method performance for ramr and other methods across all test scenarios. NB that some methods (e.g. comb-p) are slow, thus modern hardware and parallelism is advised. As comb-p uses at least four threads by default, it was forced to run in a single threaded mode by limiting \"processes\" parameter of multiprocessing.pool.Pool call to \"1\" in comb-p sorce code (cpv/_common.py).","label":"REVISION-performance-metrics.R","restricted":false,"directoryLabel":"revision/R","version":1,"datasetVersionId":3842,"dataFile":{"id":101261,"persistentId":"doi:10.18710/ED8HSD/J7NL8C","pidURL":"https://doi.org/10.18710/ED8HSD/J7NL8C","filename":"REVISION-performance-metrics.R","contentType":"type/x-r-syntax","filesize":31391,"description":"an R script that generates simulated test data sets using GSE51032 as a template and evaluates method performance for ramr and other methods across all test scenarios. NB that some methods (e.g. comb-p) are slow, thus modern hardware and parallelism is advised. As comb-p uses at least four threads by default, it was forced to run in a single threaded mode by limiting \"processes\" parameter of multiprocessing.pool.Pool call to \"1\" in comb-p sorce code (cpv/_common.py).","storageIdentifier":"S3://2002-yellow-dataverseno:17a2de25c27-7d4e73403c0d","rootDataFileId":-1,"md5":"ea44e02b075e6dfbcd7d8616b242e4d3","checksum":{"type":"MD5","value":"ea44e02b075e6dfbcd7d8616b242e4d3"},"creationDate":"2021-06-21"}},{"description":"an R script that compares ramr and DMRcate across various random seeds","label":"REVISION-ramr-dmrcate.R","restricted":false,"directoryLabel":"revision/R","version":1,"datasetVersionId":3842,"dataFile":{"id":101258,"persistentId":"doi:10.18710/ED8HSD/ID3ZOQ","pidURL":"https://doi.org/10.18710/ED8HSD/ID3ZOQ","filename":"REVISION-ramr-dmrcate.R","contentType":"type/x-r-syntax","filesize":9339,"description":"an R script that compares ramr and DMRcate across various random seeds","storageIdentifier":"S3://2002-yellow-dataverseno:17a2de25d6e-cec67e5af349","rootDataFileId":-1,"md5":"6c28284c51c9eb41ebdd56de4eefc9db","checksum":{"type":"MD5","value":"6c28284c51c9eb41ebdd56de4eefc9db"},"creationDate":"2021-06-21"}},{"description":"An R data file containing the following objects: 1. simulated.ranges: a GRanges object containing simulated beta values for 485512 CpGs across 100 samples with aberrant methylation events introduced in some of the regions. 2. modified.regions.unique: a GRanges object containing genomic ranges and sample ID for 1000 true positive unique aberrantly methylated regions. 3. modified.regions.nonunique: a GRanges object containing genomic ranges and sample ID for 3000 true positive non-unique aberrantly methylated regions. 4. delta: delta value by which CpG beta values corresponding to particular region/sample pair in \"simulated.ranges\" were increased or decreased. 5. merge.window: \"merge.window\" parameter value to use during testing. 6. min.cpgs: \"min.cpgs\" parameter value to use during testing. 7. sample.ids: a character vector of sample IDs.","label":"SIMULATED-5-1000-0.025.data.Rdata","restricted":false,"directoryLabel":"data/SIMULATED","version":1,"datasetVersionId":3842,"dataFile":{"id":86826,"persistentId":"doi:10.18710/ED8HSD/QVDNUK","pidURL":"https://doi.org/10.18710/ED8HSD/QVDNUK","filename":"SIMULATED-5-1000-0.025.data.Rdata","contentType":"application/gzip","filesize":368946490,"description":"An R data file containing the following objects: 1. simulated.ranges: a GRanges object containing simulated beta values for 485512 CpGs across 100 samples with aberrant methylation events introduced in some of the regions. 2. modified.regions.unique: a GRanges object containing genomic ranges and sample ID for 1000 true positive unique aberrantly methylated regions. 3. modified.regions.nonunique: a GRanges object containing genomic ranges and sample ID for 3000 true positive non-unique aberrantly methylated regions. 4. delta: delta value by which CpG beta values corresponding to particular region/sample pair in \"simulated.ranges\" were increased or decreased. 5. merge.window: \"merge.window\" parameter value to use during testing. 6. min.cpgs: \"min.cpgs\" parameter value to use during testing. 7. sample.ids: a character vector of sample IDs.","storageIdentifier":"S3://2002-yellow-dataverseno:175f9d870a5-7a9d845a9081","rootDataFileId":-1,"md5":"ee3bd504f36ffe850383a5d488bb6a95","checksum":{"type":"MD5","value":"ee3bd504f36ffe850383a5d488bb6a95"},"creationDate":"2020-11-24"}},{"description":"An R data file which is similar to \"data∕SIMULATED/SIMULATED-5-1000-0.025.data.Rdata\", but was generated using \"delta=0.050\"","label":"SIMULATED-5-1000-0.050.data.Rdata","restricted":false,"directoryLabel":"data/SIMULATED","version":1,"datasetVersionId":3842,"dataFile":{"id":86832,"persistentId":"doi:10.18710/ED8HSD/VGNS57","pidURL":"https://doi.org/10.18710/ED8HSD/VGNS57","filename":"SIMULATED-5-1000-0.050.data.Rdata","contentType":"application/gzip","filesize":368946539,"description":"An R data file which is similar to \"data∕SIMULATED/SIMULATED-5-1000-0.025.data.Rdata\", but was generated using \"delta=0.050\"","storageIdentifier":"S3://2002-yellow-dataverseno:175f9d889e1-b776ae29eff3","rootDataFileId":-1,"md5":"7f2d709785f68949cad639b577545039","checksum":{"type":"MD5","value":"7f2d709785f68949cad639b577545039"},"creationDate":"2020-11-24"}},{"description":"An R data file which is similar to \"data∕SIMULATED/SIMULATED-5-1000-0.025.data.Rdata\", but was generated using \"delta=0.100\"","label":"SIMULATED-5-1000-0.100.data.Rdata","restricted":false,"directoryLabel":"data/SIMULATED","version":1,"datasetVersionId":3842,"dataFile":{"id":86833,"persistentId":"doi:10.18710/ED8HSD/JZSZWE","pidURL":"https://doi.org/10.18710/ED8HSD/JZSZWE","filename":"SIMULATED-5-1000-0.100.data.Rdata","contentType":"application/gzip","filesize":368947353,"description":"An R data file which is similar to \"data∕SIMULATED/SIMULATED-5-1000-0.025.data.Rdata\", but was generated using \"delta=0.100\"","storageIdentifier":"S3://2002-yellow-dataverseno:175f9d8a4e8-e0d84072c3a9","rootDataFileId":-1,"md5":"bdfff1e851824f8481e6e85ef616fa18","checksum":{"type":"MD5","value":"bdfff1e851824f8481e6e85ef616fa18"},"creationDate":"2020-11-24"}},{"description":"An R data file which is similar to \"data∕SIMULATED/SIMULATED-5-1000-0.025.data.Rdata\", but was generated using \"delta=0.250\"","label":"SIMULATED-5-1000-0.250.data.Rdata","restricted":false,"directoryLabel":"data/SIMULATED","version":1,"datasetVersionId":3842,"dataFile":{"id":86829,"persistentId":"doi:10.18710/ED8HSD/HHFGXU","pidURL":"https://doi.org/10.18710/ED8HSD/HHFGXU","filename":"SIMULATED-5-1000-0.250.data.Rdata","contentType":"application/gzip","filesize":368935068,"description":"An R data file which is similar to \"data∕SIMULATED/SIMULATED-5-1000-0.025.data.Rdata\", but was generated using \"delta=0.250\"","storageIdentifier":"S3://2002-yellow-dataverseno:175f9d8c943-dfcfad711660","rootDataFileId":-1,"md5":"491f986c04f2e3f0404e94502a2cae67","checksum":{"type":"MD5","value":"491f986c04f2e3f0404e94502a2cae67"},"creationDate":"2020-11-24"}},{"description":"An R data file which is similar to \"data∕SIMULATED/SIMULATED-5-1000-0.025.data.Rdata\", but was generated using \"delta=0.500\"","label":"SIMULATED-5-1000-0.500.data.Rdata","restricted":false,"directoryLabel":"data/SIMULATED","version":1,"datasetVersionId":3842,"dataFile":{"id":86828,"persistentId":"doi:10.18710/ED8HSD/0ZEUYI","pidURL":"https://doi.org/10.18710/ED8HSD/0ZEUYI","filename":"SIMULATED-5-1000-0.500.data.Rdata","contentType":"application/gzip","filesize":368916081,"description":"An R data file which is similar to \"data∕SIMULATED/SIMULATED-5-1000-0.025.data.Rdata\", but was generated using \"delta=0.500\"","storageIdentifier":"S3://2002-yellow-dataverseno:175f9d8e28d-71f00ff234c6","rootDataFileId":-1,"md5":"9c77b6926d9f5471894b8e6e290583b4","checksum":{"type":"MD5","value":"9c77b6926d9f5471894b8e6e290583b4"},"creationDate":"2020-11-24"}},{"description":"An R data file containing the following objects: 1. tcga.data: an object containing beta values for 487192 CpGs across 192 samples from TCGA-COAD Illumina HumanMethylation 450 data set (https://portal.gdc.cancer.gov/). This data set was used to find epimutations undergoing positive selection during carcinogenesys.","label":"TCGA-COAD.tcga.data.Rdata","restricted":false,"directoryLabel":"data/TCGA-COAD","version":1,"datasetVersionId":3842,"dataFile":{"id":86824,"persistentId":"doi:10.18710/ED8HSD/2QT3OB","pidURL":"https://doi.org/10.18710/ED8HSD/2QT3OB","filename":"TCGA-COAD.tcga.data.Rdata","contentType":"application/gzip","filesize":379662007,"description":"An R data file containing the following objects: 1. tcga.data: an object containing beta values for 487192 CpGs across 192 samples from TCGA-COAD Illumina HumanMethylation 450 data set (https://portal.gdc.cancer.gov/). This data set was used to find epimutations undergoing positive selection during carcinogenesys.","storageIdentifier":"S3://2002-yellow-dataverseno:175f9d3128f-65825674b925","rootDataFileId":-1,"md5":"b87f97f9113c17a700c08ed3b9e3f6ac","checksum":{"type":"MD5","value":"b87f97f9113c17a700c08ed3b9e3f6ac"},"creationDate":"2020-11-24"}},{"description":"JSON file containing information on all Illumina Human Methylation 450 assay results in TCGA.","label":"files.2019-10-10.json","restricted":false,"directoryLabel":"data/TCGA-COAD","version":1,"datasetVersionId":3842,"dataFile":{"id":86822,"persistentId":"doi:10.18710/ED8HSD/PN0JQI","pidURL":"https://doi.org/10.18710/ED8HSD/PN0JQI","filename":"files.2019-10-10.json","contentType":"application/json","filesize":4455462,"description":"JSON file containing information on all Illumina Human Methylation 450 assay results in TCGA.","storageIdentifier":"S3://2002-yellow-dataverseno:175f9d31ae9-9afcb2670cab","rootDataFileId":-1,"md5":"6755701032361ea047148d99d06ee374","checksum":{"type":"MD5","value":"6755701032361ea047148d99d06ee374"},"creationDate":"2020-11-24"}},{"description":"human hg19 to mouse mm9 liftover chain. Required for R/PoC.results.GSE105018.R.","label":"hg19ToMm9.over.chain","restricted":false,"directoryLabel":"data/resources/liftOver","version":1,"datasetVersionId":3842,"dataFile":{"id":86840,"persistentId":"doi:10.18710/ED8HSD/E6IYWL","pidURL":"https://doi.org/10.18710/ED8HSD/E6IYWL","filename":"hg19ToMm9.over.chain","contentType":"application/octet-stream","filesize":252045507,"description":"human hg19 to mouse mm9 liftover chain. Required for R/PoC.results.GSE105018.R.","storageIdentifier":"S3://2002-yellow-dataverseno:175f9f18758-519514d278fb","rootDataFileId":-1,"md5":"f2663fa9723977cc8b7ce5383bc4ab1f","checksum":{"type":"MD5","value":"f2663fa9723977cc8b7ce5383bc4ab1f"},"creationDate":"2020-11-24"}},{"description":"LOLA database containing information on mouse embryonic chromatin marks. Required for R/PoC.results.GSE105018.R. Ungzip before using.","label":"mouse.embryonic.marks.tar.gz","restricted":false,"directoryLabel":"data/resources/LOLA/db","version":1,"datasetVersionId":3842,"dataFile":{"id":86837,"persistentId":"doi:10.18710/ED8HSD/IQMCIN","pidURL":"https://doi.org/10.18710/ED8HSD/IQMCIN","filename":"mouse.embryonic.marks.tar.gz","contentType":"application/gzip","filesize":1881350089,"description":"LOLA database containing information on mouse embryonic chromatin marks. Required for R/PoC.results.GSE105018.R. Ungzip before using.","storageIdentifier":"S3://2002-yellow-dataverseno:175f9f208ab-3a25136ee6cb","rootDataFileId":-1,"md5":"1915579eadd2cc76a9c2de09273a3279","checksum":{"type":"MD5","value":"1915579eadd2cc76a9c2de09273a3279"},"creationDate":"2020-11-24"}},{"description":"An R data file with all the performance metrics obtained by R/PoC.results.SIMULATED.R.","label":"results-5-1000.Rdata","restricted":false,"directoryLabel":"data/SIMULATED","version":1,"datasetVersionId":3842,"dataFile":{"id":86825,"persistentId":"doi:10.18710/ED8HSD/XHCTWT","pidURL":"https://doi.org/10.18710/ED8HSD/XHCTWT","filename":"results-5-1000.Rdata","contentType":"application/gzip","filesize":73769,"description":"An R data file with all the performance metrics obtained by R/PoC.results.SIMULATED.R.","storageIdentifier":"S3://2002-yellow-dataverseno:175f9d856b2-371618949869","rootDataFileId":-1,"md5":"1bae7f83d47c2121aa626923d02a77cf","checksum":{"type":"MD5","value":"1bae7f83d47c2121aa626923d02a77cf"},"creationDate":"2020-11-24"}},{"description":"detailed performance metrics obtained by R/PoC.results.SIMULATED.R.","label":"results-5-1000.results.tab","restricted":false,"directoryLabel":"data/SIMULATED","version":1,"datasetVersionId":3842,"dataFile":{"id":86831,"persistentId":"doi:10.18710/ED8HSD/6UWG69","pidURL":"https://doi.org/10.18710/ED8HSD/6UWG69","filename":"results-5-1000.results.tab","contentType":"text/tab-separated-values","filesize":67873,"description":"detailed performance metrics obtained by R/PoC.results.SIMULATED.R.","storageIdentifier":"S3://2002-yellow-dataverseno:175f9d85759-c285f259f6ac","originalFileFormat":"text/tsv","originalFormatLabel":"Tab-Separated Values","originalFileSize":69289,"originalFileName":"results-5-1000.results.tsv","UNF":"UNF:6:Jb8LIc18+DBR9OX2xWFcXQ==","rootDataFileId":-1,"md5":"902d2c58ce7ae7c80cc9d74167827953","checksum":{"type":"MD5","value":"902d2c58ce7ae7c80cc9d74167827953"},"creationDate":"2020-11-24"}},{"description":"plots for performance metrics obtained by R/PoC.results.SIMULATED.R.","label":"results-plots-5-1000.pdf","restricted":false,"directoryLabel":"data/SIMULATED","version":1,"datasetVersionId":3842,"dataFile":{"id":86830,"persistentId":"doi:10.18710/ED8HSD/ZLD0KQ","pidURL":"https://doi.org/10.18710/ED8HSD/ZLD0KQ","filename":"results-plots-5-1000.pdf","contentType":"application/pdf","filesize":50898,"description":"plots for performance metrics obtained by R/PoC.results.SIMULATED.R.","storageIdentifier":"S3://2002-yellow-dataverseno:175f9d857eb-bf490299444a","rootDataFileId":-1,"md5":"ec67c898bd356af2f61ef35562a027f1","checksum":{"type":"MD5","value":"ec67c898bd356af2f61ef35562a027f1"},"creationDate":"2020-11-24"}},{"description":"An R data file containing the following objects: 1. tcga.coad.ranges: GRanges object with TCGA-COAD subset of methylation beta values for adjacent and tumour tissue pairs. 2. tcga.coad.ramr.hg19: GRanges object with aberrantly methylated regions found in adjacent TCGA-COAD tissue samples.","label":"tcga.coad.AMRs.Rdata","restricted":false,"directoryLabel":"data/TCGA-COAD","version":1,"datasetVersionId":3842,"dataFile":{"id":86823,"persistentId":"doi:10.18710/ED8HSD/B8WZTI","pidURL":"https://doi.org/10.18710/ED8HSD/B8WZTI","filename":"tcga.coad.AMRs.Rdata","contentType":"application/gzip","filesize":197710002,"description":"An R data file containing the following objects: 1. tcga.coad.ranges: GRanges object with TCGA-COAD subset of methylation beta values for adjacent and tumour tissue pairs. 2. tcga.coad.ramr.hg19: GRanges object with aberrantly methylated regions found in adjacent TCGA-COAD tissue samples.","storageIdentifier":"S3://2002-yellow-dataverseno:175f9d324e6-e80f962ad60d","rootDataFileId":-1,"md5":"b07cf6d556a8cb446028d58fca583ca8","checksum":{"type":"MD5","value":"b07cf6d556a8cb446028d58fca583ca8"},"creationDate":"2020-11-24"}},{"description":"An R data file containing character vector with all polymorphic and non-specific Illumina HumanMethylation 450 probe IDs.","label":"to.remove.RData","restricted":false,"directoryLabel":"data/resources","version":1,"datasetVersionId":3842,"dataFile":{"id":86839,"persistentId":"doi:10.18710/ED8HSD/B3O7HR","pidURL":"https://doi.org/10.18710/ED8HSD/B3O7HR","filename":"to.remove.RData","contentType":"application/gzip","filesize":409933,"description":"An R data file containing character vector with all polymorphic and non-specific Illumina HumanMethylation 450 probe IDs.","storageIdentifier":"S3://2002-yellow-dataverseno:175f9f236d5-7ba46795b995","rootDataFileId":-1,"md5":"4611f9e92657d846cc2ea84eea518ea8","checksum":{"type":"MD5","value":"4611f9e92657d846cc2ea84eea518ea8"},"creationDate":"2020-11-24"}},{"description":"Methylation profiles of eight random genomic regions from the GSE51032 template data set (“GSE51032”) together with the methylation profiles of the same regions in each of five simulated test data sets obtained by changing beta values within those regions for a subset of samples (highlighted) by a particular delta (“delta=0.025”, “0.050”, “0.100”, “0.250” or “0.500”).","label":"writer-plots-5-1000.pdf","restricted":false,"directoryLabel":"data/SIMULATED","version":1,"datasetVersionId":3842,"dataFile":{"id":86827,"persistentId":"doi:10.18710/ED8HSD/CSTQJU","pidURL":"https://doi.org/10.18710/ED8HSD/CSTQJU","filename":"writer-plots-5-1000.pdf","contentType":"application/pdf","filesize":7046951,"description":"Methylation profiles of eight random genomic regions from the GSE51032 template data set (“GSE51032”) together with the methylation profiles of the same regions in each of five simulated test data sets obtained by changing beta values within those regions for a subset of samples (highlighted) by a particular delta (“delta=0.025”, “0.050”, “0.100”, “0.250” or “0.500”).","storageIdentifier":"S3://2002-yellow-dataverseno:175f9d8ea61-fbc9e30b2d9f","rootDataFileId":-1,"md5":"e3cf65636cc64f6ce7d3a42452a31320","checksum":{"type":"MD5","value":"e3cf65636cc64f6ce7d3a42452a31320"},"creationDate":"2020-11-24"}}],"citation":"Nikolaienko, Oleksii, 2020, \"Replication Data for: \"ramr: an R package for detection of rare aberrantly methylated regions\"\", https://doi.org/10.18710/ED8HSD, DataverseNO, V2, UNF:6:mHk2VYyjhtEzz5muuhFrqw== [fileUNF]"}}