Package cregulome. September 13, 2018

Similar documents
Package cattonum. R topics documented: May 2, Type Package Version Title Encode Categorical Features

Package fastdummies. January 8, 2018

Package dbx. July 5, 2018

Package condusco. November 8, 2017

Package gggenes. R topics documented: November 7, Title Draw Gene Arrow Maps in 'ggplot2' Version 0.3.2

Package calpassapi. August 25, 2018

Package vinereg. August 10, 2018

Package validara. October 19, 2017

Package qualmap. R topics documented: September 12, Type Package

Package postgistools

Package SEMrushR. November 3, 2018

Package geojsonsf. R topics documented: January 11, Type Package Title GeoJSON to Simple Feature Converter Version 1.3.

Package data.world. April 5, 2018

Package gifti. February 1, 2018

Package ggimage. R topics documented: December 5, Title Use Image in 'ggplot2' Version 0.1.0

Package ggimage. R topics documented: November 1, Title Use Image in 'ggplot2' Version 0.0.7

Package widyr. August 14, 2017

Package gtrendsr. October 19, 2017

Package bisect. April 16, 2018

Package ECctmc. May 1, 2018

Package canvasxpress

Package WordR. September 7, 2017

Package internetarchive

Package fitbitscraper

Package barcoder. October 26, 2018

Package docxtools. July 6, 2018

Package GetITRData. October 22, 2017

Package deductive. June 2, 2017

Package datasets.load

Package messaging. May 27, 2018

Package bikedata. April 27, 2018

Package facerec. May 14, 2018

Package kdtools. April 26, 2018

Package robotstxt. November 12, 2017

Package gtrendsr. August 4, 2018

Package NFP. November 21, 2016

Package effectr. January 17, 2018

Package splithalf. March 17, 2018

Package rsppfp. November 20, 2018

Package preprosim. July 26, 2016

Package enrichplot. September 29, 2018

Package balance. October 12, 2018

Package fastqcr. April 11, 2017

Package opencage. January 16, 2018

Package goodpractice

Package RODBCext. July 31, 2017

Package knitrprogressbar

Package darksky. September 20, 2017

Package bigreadr. R topics documented: August 13, Version Date Title Read Large Text Files

Package projector. February 27, 2018

Package wrswor. R topics documented: February 2, Type Package

Package VSE. March 21, 2016

Package statsdk. September 30, 2017

Package spark. July 21, 2017

Package githubinstall

Package vip. June 15, 2018

Package rprojroot. January 3, Title Finding Files in Project Subdirectories Version 1.3-2

Package snplist. December 11, 2017

Package ezknitr. September 16, 2016

Package pcr. November 20, 2017

Package TrafficBDE. March 1, 2018

Package rzeit2. January 7, 2019

Package omu. August 2, 2018

Package meme. December 6, 2017

Package queuecomputer

Package kirby21.base

Package sigqc. June 13, 2018

Package essurvey. August 23, 2018

Package virustotal. May 1, 2017

Package geneslope. October 26, 2016

Package meme. November 2, 2017

Package semver. January 6, 2017

Package tidytransit. March 4, 2019

Package wikitaxa. December 21, 2017

Package repec. August 31, 2018

Package dkanr. July 12, 2018

Package rgho. R topics documented: January 18, 2017

Package reval. May 26, 2015

Package benchmarkme. R topics documented: February 26, Type Package Title Crowd Sourced System Benchmarks Version 0.2.

Package librarian. R topics documented:

Package geogrid. August 19, 2018

Package sfc. August 29, 2016

Package pwrab. R topics documented: June 6, Type Package Title Power Analysis for AB Testing Version 0.1.0

Package guardianapi. February 3, 2019

Package RPresto. July 13, 2017

Package regexselect. R topics documented: September 22, Version Date Title Regular Expressions in 'shiny' Select Lists

Package fingertipsr. May 25, Type Package Version Title Fingertips Data for Public Health

Package quickreg. R topics documented:

Package skynet. December 12, 2018

Package diffusr. May 17, 2018

Package clipr. June 23, 2018

Package WhiteStripe. April 19, 2018

Package available. November 17, 2017

Package stapler. November 27, 2017

Package weco. May 4, 2018

Package rwars. January 14, 2017

Package Organism.dplyr

Package mdftracks. February 6, 2017

Package jdx. R topics documented: January 9, Type Package Title 'Java' Data Exchange for 'R' and 'rjava'

Package readxl. April 18, 2017

Transcription:

Type Package Version 0.2.0 Package cregulome September 13, 2018 Title Obtain and Visualize Regulome-Gene Expression Correlations in Cancer Builds a 'SQLite' database file of pre-calculated transcription factor/microrna-gene correlations (co-expression) in cancer from the Cistrome Cancer Liu et al. (2011) <doi:10.1186/gb-2011-12-8-r83> and 'mircancerdb' databases (in press). Provides custom classes and functions to query, tidy and plot the correlation data. License GPL-3 URL https://github.com/ropensci/cregulome BugReports https://github.com/ropensci/cregulome/issues Encoding UTF-8 LazyData true RoxygenNote 6.0.1 Depends R(>= 2.10.0) Imports DBI, graphics, httr, RSQLite, UpSetR, VennDiagram, ggplot2, ggridges, grid, R.utils, igraph VignetteBuilder knitr Suggests knitr, rmarkdown, testthat, covr, readxl, AnnotationDbi, org.hs.eg.db, clusterprofiler NeedsCompilation no Author Mahmoud Ahmed [aut, cre] Maintainer Mahmoud Ahmed <mahmoud.s.fahmy@students.kasralainy.edu.eg> Repository CRAN Date/Publication 2018-09-13 11:50:03 UTC 1

2 cmicrorna R topics documented: cmicrorna......................................... 2 cor_hist........................................... 3 cor_igraph.......................................... 4 cor_joy........................................... 5 cor_plot........................................... 6 cor_prep........................................... 7 cor_tidy........................................... 7 cor_upset.......................................... 8 cor_venn_diagram..................................... 9 cregulome......................................... 10 ctf............................................. 11 get_db............................................ 11 get_mir........................................... 12 get_tf............................................ 14 stat_collect......................................... 15 stat_make.......................................... 16 Index 17 cmicrorna Construct cmicrorna object Constructs an S3 object called cmicrorna contains data returned by calling get_mir. Used to define methods for printing and visualizing microrna-gene expression correlations. cmicrorna(dat_mir) dat_mir A data.frame such as this returned by calling get_mir. An S3 object of class cmicrorna dat <- get_mir(conn,

cor_hist 3 mir = 'hsa-let-7g', = 'STES', min_abs_cor =.3, max_num = 5) # make a cmicrorna object cmir <- cmicrorna(dat) cor_hist A histogram of the correlations of microrna or tf sets Plot a hist of sets of micrornas or transcription factors-gene correlations in a TCGA. cor_hist(ob,,...) ob... Other options A cmicrorna or ctf object such as this returned by calling cmicrorna or ctf. A character vector of The Cancer Genome Atlas (TCGA) identifiers. To An hist plot of the correlations values between genes a microrna or a transcription factor in a TCGA dat <- get_mir(conn, mir = c('hsa-let-7g', 'hsa-let-7i'), = 'STES') # make a cmicrorna object cmir <- cmicrorna(dat) # print object

4 cor_igraph cor_hist(cmir) cor_igraph Make an igraph object An igraph object of from cmicrorna or ctf objects. cor_igraph(ob) ob A cmicrorna or ctf object such as this returned by calling cmicrorna or ctf. An igraph object # load required libraries library(rsqlite) library(cregulome) conn <- dbconnect(sqlite(), fl) dat <- get_mir(conn, mir = c('hsa-let-7g', 'hsa-let-7i'), = 'STES') # make a cmicrorna object cmir <- cmicrorna(dat) # print object cor_igraph(cmir)

cor_joy 5 cor_joy A joy plot of correlation of microrna or tf sets A ggridges joy plot of sets of micrornas or transcription factors-gene correlations in a TCGA. cor_joy(ob,,...) ob... Other options A cmicrorna or ctf object such as this returned by calling cmicrorna or ctf. A character vector of The Cancer Genome Atlas (TCGA) identifiers. To An ggridges plot object dat <- get_mir(conn, mir = c('hsa-let-7g', 'hsa-let-7i'), = 'STES') # make a cmicrorna object cmir <- cmicrorna(dat) # print object cor_joy(cmir)

6 cor_plot cor_plot Plot method for cmicrorna and ctf objects A dot plot of microrna/tf correlation in a single of TCGA. When the object cmicrorna/ctf contains more than one TCGA studies, the argument is a requirement. cor_plot(ob,,...) ob... Other options A cmicrorna or ctf object such as this returned by calling cmicrorna or ctf. A character vector of The Cancer Genome Atlas (TCGA) identifiers. To A ggplot object of a dot plot of the correlation values between genes and micrornas or transcription factors in a TCGA. dat <- get_mir(conn, mir = 'hsa-let-7g', = 'STES', min_abs_cor =.3, max_num = 5) # make a cmicrorna object cmir <- cmicrorna(dat) # print object cor_plot(cmir)

cor_prep 7 cor_prep Prepare correlation data for plotting Not meant to be called direclty by the user. cor_prep(ob,, add_dir = TRUE, add_corr = TRUE) ob add_dir add_corr A cmicrorna or ctf object such as this returned by calling cmicrorna or ctf. A character vector of The Cancer Genome Atlas (TCGA) identifiers. To A logical default TRUE for whether to add a column called Direction that has the direction of the correlation; positive or negative. A logical default TRUE for whether to add a colum called Correlation that has the absolute value of the correlation A data.frame cor_tidy Tidy cmicrorna and ctf objects Tidy cmicrorna and ctf objects cor_tidy(ob) ob A cmicrorna or ctf object such as this returned by calling cmicrorna or ctf. A tidy data.frame of four columns. mirna_base or tfis the microrna mirbase IDs, feature is the features/genes, cor is the corresponding expression correlations and is TCGA ID.

8 cor_upset dat <- get_mir(conn, mir = 'hsa-let-7g', = 'STES', min_abs_cor =.3, max_num = 5) # make a cmicrorna object cmir <- cmicrorna(dat) # convert cmicrorna object to a tidy data.frame tidy_cmir <- cor_tidy(cmir) cor_upset upset plot of microrna or tf sets upset of sets of micrornas or transcription factors and their correlated features in a TCGA. cor_upset(ob,,...) ob... Other options A cmicrorna or ctf object such as this returned by calling cmicrorna or ctf. A character vector of The Cancer Genome Atlas (TCGA) identifiers. To An upset plot

cor_venn_diagram 9 dat <- get_mir(conn, mir = c('hsa-let-7g', 'hsa-let-7i'), = 'STES') # make a cmicrorna object cmir <- cmicrorna(dat) # print object cor_upset(cmir) cor_venn_diagram Venn Diagram of microrna or transcription factor correlated features Count and plot the numbers of microrna correlated features in cmicrorna object. cor_venn_diagram(ob,,...) ob... Other options A cmicrorna or ctf object such as this returned by calling cmicrorna or ctf. A character vector of The Cancer Genome Atlas (TCGA) identifiers. To A venn diagram with a circle or an ellipses for each microrna and the number of correlated features.

10 cregulome dat <- get_mir(conn, mir = c('hsa-let-7g', 'hsa-let-7i'), = 'STES') # make a cmicrorna object cmir <- cmicrorna(dat) # make graph cor_venn_diagram(cmir) cregulome cregulome package Download, access and visualize Regulome (microrna and transcription factors) data from mir- Cancer and Cistrome cancer cregulome functions to download and query the database file get_db get_tf get_mir cregulome functions to create S3 objects ctf cmicrorna cregulome functions to reshape S3 objects cor_tidy cor_igraph cregulome functions to visualize data in S3 objects cor_hist cor_joy cor_plot cor_upset cor_venn_diagram

ctf 11 ctf Construct ctf object Constructs an S3 object called ctf contains data returned by calling get_tf. Used to define methods for printing and visualizing transcription factors-gene expression correlations. ctf(dat_tf) dat_tf A data.frame such as this returned by calling get_tf. An S3 object of class ctf dat <- get_tf(conn, tf = 'LEF1', = '"STES*"', min_abs_cor =.3, max_num = 5) # make a ctf object ctf <- ctf(dat) get_db Get cregulome.db file This function calls download.file to download the pre-build database file of cregulome. Additionally, the function checks the validity of the pre-defined URL and whether the database file exists in the current working directory to avoid redownloading it. Typically, users would run this function once at the first time the use the package or to update the database to the latest version.

12 get_mir get_db(test = FALSE, destfile,...) test destfile A logical, default FALSE. When TRUE downloads a database file with the same structure with a subset of the data for speed. A character vector for the desired path for the database file. By default, when not specified, is constructed by using tempdir as a directory and the string cregulome.db.gz... Optional arguments passed to download.file Downloads a compressed sqlite file to the current working directory. The file is named cregulome.db.gz by default and it s not advised to change the name to avoid breaking the other functions that calls the database. ## Not run: # download a test set of the database get_db(test = TRUE) # download the full database file get_db(test = FALSE) ## End(Not run) # load the test db file from shipped with the pacakge db_file <- system.file("extdata", "cregulome.db", package = "cregulome") file.info(db_file) get_mir Get microrna correlations from cregulome.db This function access the sqlite database file which is obtained by running get_db. Basically, the function provides ways to query the database to the correlation data of the micrornas of interest. The function returns an error if the database file cregulome.db is not in the working directory. get_mir(conn, mir,, min_abs_cor, max_num, targets_only = FALSE)

get_mir 13 conn mir min_abs_cor max_num targets_only A connection such as this returned by dbconnect A required character vector of the micrornas of interest. These are the mir- Base ID which are the official identifiers of the widely used mirbase database, http://www.mirbase.org/. A character vector of The Cancer Genome Atlas (TCGA) identifiers. To A numeric, an absolute correlation minimum between 0 and 1 for each mir. An integer, maximum number of features to show for each mir in each. A logical, default FALSE. When TRUE, features will be the microrna targets as defined in the package targetscan.hs.eg.db. A tidy data.frame of four columns. mirna_base is the microrna mirbase IDs, feature is the features/genes, cor is the corresponding expression correlations and is TCGA ID. # get microrna correlations in all studies get_mir(conn, mir = 'hsa-let-7g') # get correlations in a particular get_mir(conn, mir = 'hsa-let-7g', = 'STES') get_mir(conn, mir = 'hsa-let-7g', = 'STES', min_abs_cor =.3, max_num = 5)

14 get_tf get_tf Get transcription factor correlations from cregulome.db This function access the sqlite database file which is obtained by running get_db. Basically, the function provides ways to query the database to the correlation data of the transcription factors of interest. The function returns an error if the database file cregulome.db is not in the working directory. get_tf(conn, tf,, min_abs_cor, max_num, targets_only = FALSE) conn tf min_abs_cor max_num targets_only A connection such as this returned by dbconnect A required character vector of the transcription factor of interest. These are the HUGO official gene symbols of the genes contains the transcription factor. A character vector of The Cancer Genome Atlas (TCGA) identifiers. To A numeric, an absolute correlation minimum between 0 and 1 for each mir. An integer, maximum number of features to show for each mir in each. A logical, default FALSE. When TRUE, features will be the microrna targets as defined in the package targetscan.hs.eg.db. A tidy data.frame of four columns. tf is the official gene symbols of the genes contains the transcription factor, feature is the features/genes, cor is the corresponding expression correlations and is TCGA ID. ## Not run: # get transcription factors correlations in all studies get_tf(conn, tf = 'LEF1') ## End(Not run)

stat_collect 15 # get correlations in a particular get_tf(conn, tf = 'LEF1', = '"STES*"') get_tf(conn, tf = 'LEF1', = '"STES*"', min_abs_cor =.3, max_num = 5) stat_collect Collect data from SQLite database Not meant to be called direclty by the user. stat_collect(conn,, stat, type = "mir") conn stat type A connection such as this returned by dbconnect A character vector of The Cancer Genome Atlas (TCGA) identifiers. To A string such as this returned by stat_make A character string. Either mir of tf. Used to define columns and tables names. A data.frame

16 stat_make stat_make Make A SQL statment Not meant to be called direclty by the user. stat_make(reg,, min_abs_cor, max_num, targets_only = FALSE, type = "mir") reg min_abs_cor max_num targets_only type A character vector of one or more regulator ID. A character vector of The Cancer Genome Atlas (TCGA) identifiers. To A numeric, an absolute correlation minimum between 0 and 1 for each mir. An integer, maximum number of features to show for each mir in each. A logical, default FALSE. When TRUE, features will be the microrna targets as defined in the package targetscan.hs.eg.db. A character string. Either mir of tf. Used to define columns and tables names. A character string stat_make(reg = 'hsa-let-7g', = 'STES') stat_make(reg = 'hsa-let-7g', = 'STES', min_abs_cor =.3) stat_make(reg = 'hsa-let-7g', = 'STES', min_abs_cor =.3, max_num = 5)

Index cmicrorna, 2, 3 10 cor_hist, 3, 10 cor_igraph, 4, 10 cor_joy, 5, 10 cor_plot, 6, 10 cor_prep, 7 cor_tidy, 7, 10 cor_upset, 8, 10 cor_venn_diagram, 9, 10 cregulome, 10 cregulome-package (cregulome), 10 ctf, 3 10, 11 dbconnect, 13 15 download.file, 11, 12 get_db, 10, 11, 12, 14 get_mir, 2, 10, 12 get_tf, 10, 11, 14 ggridges, 5 hist, 3 stat_collect, 15 stat_make, 15, 16 tempdir, 12 upset, 8 17