Package ecolitk. November 2, 2018

Similar documents
Package splicegear. R topics documented: December 4, Title splicegear Version Author Laurent Gautier

Package makecdfenv. March 13, 2019

Package affypdnn. January 10, 2019

Package RLMM. March 7, 2019

Package OmicCircos. R topics documented: November 17, Version Date

Package areaplot. October 18, 2017

Package OmicCircos. R topics documented: November 1, Version Date Title High-quality circular visualization of omic data

Package les. October 4, 2013

Package macorrplot. R topics documented: October 2, Title Visualize artificial correlation in microarray data. Version 1.50.

Package geecc. R topics documented: December 7, Type Package

Package quantsmooth. R topics documented: May 4, Type Package

Package NetWeaver. January 11, 2017

Package STROMA4. October 22, 2018

Package BAC. R topics documented: November 30, 2018

xmapbridge: Graphically Displaying Numeric Data in the X:Map Genome Browser

Package JBTools. R topics documented: June 2, 2015

Package multigraph. R topics documented: November 1, 2017

Statistical Programming with R

Package Grid2Polygons

Package compartmap. November 10, Type Package

Package generxcluster

Package multigraph. R topics documented: January 24, 2017

Package SMAP. R topics documented: June 19, 2018

Package replicatedpp2w

Package Daim. February 15, 2013

Package DPBBM. September 29, 2016

Package clstutils. R topics documented: December 30, 2018

The som Package. September 19, Description Self-Organizing Map (with application in gene clustering)

Az R adatelemzési nyelv

Package SCAN.UPC. July 17, 2018

Package RCEIM. April 3, 2017

Package GCAI.bias. February 19, 2015

Package MIRA. October 16, 2018

Package macat. October 16, 2018

Package RTNduals. R topics documented: March 7, Type Package

Package geecc. October 9, 2015

Package BayesPeak. R topics documented: September 21, Version Date Title Bayesian Analysis of ChIP-seq Data

Package munfold. R topics documented: February 8, Type Package. Title Metric Unfolding. Version Date Author Martin Elff

Package panelview. April 24, 2018

Practical Course in Genome Bioinformatics

Preliminary Figures for Renormalizing Illumina SNP Cell Line Data

Package DMCHMM. January 19, 2019

R Graphics. Paul Murrell. The University of Auckland. R Graphics p.1/47

Package muscle. R topics documented: March 7, Type Package

Package ffpe. October 1, 2018

Package SNPchip. May 3, 2018

Package BubbleTree. December 28, 2018

CQN (Conditional Quantile Normalization)

Practical 2: Plotting

Package ChIPseeker. July 12, 2018

Package FunciSNP. November 16, 2018

Rsubread package: high-performance read alignment, quantification and mutation discovery

Package rain. March 4, 2018

Package MethylSeekR. January 26, 2019

Graphics - Part III: Basic Graphics Continued

Package PSEA. R topics documented: November 17, Version Date Title Population-Specific Expression Analysis.

Package POST. R topics documented: November 8, Type Package

Rsubread package: high-performance read alignment, quantification and mutation discovery

Package beanplot. R topics documented: February 19, Type Package

Package mixphm. July 23, 2015

Package EnQuireR. R topics documented: February 19, Type Package Title A package dedicated to questionnaires Version 0.

Package dyncorr. R topics documented: December 10, Title Dynamic Correlation Package Version Date

Package visualizationtools

Package gggenes. R topics documented: November 7, Title Draw Gene Arrow Maps in 'ggplot2' Version 0.3.2

Package ggrepel. September 30, 2017

Package qrfactor. February 20, 2015

The nor1mix Package. August 3, 2006

Package cghmcr. March 6, 2019

Package rgreat. June 19, 2018

Package RConics. R topics documented: February 19, Type Package Title Computations on Conics Version 1.0 Date Author Emanuel Huber

Package dualks. April 3, 2019

Bioconductor tutorial

Package pvclust. October 23, 2015

Package CorporaCoCo. R topics documented: November 23, 2017

Introduction Basics Simple Statistics Graphics. Using R for Data Analysis and Graphics. 4. Graphics

Package mgsa. January 13, 2019

Package SOFIA. January 22, 2017

Quality control of array genotyping data with argyle Andrew P Morgan

Exploring cdna Data. Achim Tresch, Andreas Buness, Tim Beißbarth, Wolfgang Huber

Polar Coordinates. 2, π and ( )

Package snplist. December 11, 2017

The nor1mix Package. June 12, 2007

Package HiveR. February 15, 2013

The supclust Package

Package sciplot. February 15, 2013

Package fwdmsa. R topics documented: July 8, Version 0.1. Date Title Forward search for Mokken scale analysis

Ansoft HFSS Windows Screen Windows. Topics: Side Window. Go Back. Contents. Index

Package mmeta. R topics documented: March 28, 2017

Package frma. R topics documented: March 8, Version Date Title Frozen RMA and Barcode

Package ibbig. R topics documented: December 24, 2018

Exploring cdna Data. Achim Tresch, Andreas Buness, Tim Beißbarth, Wolfgang Huber

Chapter 5 An Introduction to Basic Plotting Tools

Package fitplc. March 2, 2017

Package GLDreg. February 28, 2017

Package RobLoxBioC. August 3, 2018

Package gridextra. September 9, 2017

Package pandar. April 30, 2018

Package MSGLasso. November 9, 2016

Package tclust. May 24, 2018

Matrix algebra. Basics

Transcription:

Version 1.55.0 Date 2010-04 Title Meta-data and tools for E. coli Author Laurent Gautier Package ecolitk November 2, 2018 Maintainer Laurent Gautier <lgautier@gmail.com> biocviews Annotation, Visualization Depends R (>= 2.10) Imports Biobase, graphics, methods Suggests ecolileucine, ecolicdf, graph, multtest, affy Meta-data and tools to work with E. coli. The tools are mostly plotting functions to work with circular genomes. They can used with other genomes/plasmids. License GPL (>= 2) git_url https://git.bioconductor.org/packages/ecolitk git_branch master git_last_commit 68a90b5 git_last_commit_date 2018-10-30 Date/Publication 2018-11-01 R topics documented: ecoli.m52.genome...................................... 2 ecoligenome.operon..................................... 3 ecoligenomebnum..................................... 4 ecoligenomebnum2multifun............................. 4 gccontent.......................................... 5 linkedmultiget........................................ 5 multifun........................................... 6 pointscircle......................................... 7 polar2xy........................................... 8 polygonchrom....................................... 9 wstringapply........................................ 11 Index 13 1

2 ecoli.m52.genome ecoli.m52.genome Escherichia coli data Meta-data related to Escherichia coli data(ecoli.m52.genome) data(ecoligenomechrloc) data(ecoligenomesymbol2affy) data(ecoligenomesymbol) data(ecoligenomestrand) data(ecoligenome.operon) ecoli.len Format The format for ecoli.m52.genome is character with genome sequence. The format for ecoligenomechrloc is an environment (as a hash table). Each key is an Affyemtrix probe set ID, and each value is vector of two integers (begining and end - see the details below) The format for ecoligenomesymbol2afffy is an environment (as a hash table). Each key is a gene symbol name. The format for ecoligenomesymbol is an environment (as a hash table). Each key is an Affymetrix probe set id ecoli.len is a variable containing the size of the genome in ecoli.m52.genome. The environments ecoligenomesymbol2afffy and ecoligenomesymbol are like the ones in the data packages built by annbuilder. The environment ecoligenomechrloc differs: two integers are associated with each key, one corresponds to the begining of the segment the other to the end. The environment ecoligenomestrand returns a logical. TRUE means that the orientation is +, FALSE means that the orientation is - (and NA is used when irrelevant for the key). Source http://www.genome.wisc.edu/sequencing/k12.htm and http://www.biostat.harvard.edu/ complab/dchip/info_file.htm data(ecoli.m52.genome)

ecoligenome.operon 3 ecoligenome.operon Known operon in E.coli - data.frame The known operon in the Escherichia coli genome. data(ecoligenome.operon) Format A data frame with 932 observations (genes) on the following 4 variables. gene.name a character vector gene.annotation a character vector operon.name a factor with levels the names of the operons operon.comments a factor with levels the comments for the operons For some operons, the source of information specifies the existence of regulating elements such as promoter, terminator, box, etc... In those cases, the gene.name is set to "Regulation", and the gene.annotation gives what kind of regulating element it is. If volonteers, it would be neat to map those on the genome... Besides that, not much to add. The data structure is fairly straightforward. Source Built from the webpage: http://www.cib.nig.ac.jp/dda/backup/taitoh/ecoli.operon.html library(biobase) data(ecoligenome.operon) data(ecoligenomesymbol2affy) ## something that might be useful when working with Affymetrix data: ## get the Affymetrix identifiers for the probe sets bundled in operons ## (see the vignette for more details) ecoligenome.operon$affyid <- unname(unlist(mget(ecoligenome.operon$gene.name, ecoligenomesymbol2affy, ifnotfound=na)))

4 ecoligenomebnum2multifun ecoligenomebnum Environment for bnum identifiers Environments to associate Affymetrix probe set IDs with bnum IDs data(ecoligenomebnum) data(ecoligenomebnum2symbol) data(ecoligenomebnum2enzyme) data(ecoligenomebnum2genbank) data(ecoligenomebnum2geneproduct) data(ecoligenomesymbol2bnum) Format These are environment objects. Escherichia coli genes are sometimes identified by bnum s. This identfier is typically a b followed by digits. Source BNUM numbers were parsed out of the Affymetrix identifiers. BNUM2* were obtained from the GenProtEC website. ecoligenomebnum2multifun Environment An environment to store associtations between bnum identifiers (key) and MultiFun identifiers (or strand information). data(ecoligenomebnum2multifun) Format The format is: length 0 <environment> - attr(*, "comments")= chr "GenProtEC: MultiFun assignments for E. coli modules September 17th, 2003"

gccontent 5 Source MultiFun is a classification scheme. The structure is approximately tree-like. Several MultiFun numbers can be assigned to one bnum. "http://genprotec.mbl.edu/files/multifun.txt" gccontent function to compute gccontent A simple R function to compute the GC content of a sequence gccontent(x) Arguments x a vector of mode character Value This a simple (and not particularly fast) function to compute the GC content of sequence. When speed is an issue, one should use the function in the package matchprobes. This function only exists to avoid dependency on this package. The GC content (numeric) linkedmultiget A function to look for values across linked environments A function to look for values across linked environments. linkedmultiget(x, envir.list = list(), unique = TRUE) Arguments x envir.list unique The keys in the first environment in the list. A list of environments. Simplify the list returned by ensuring that the values for each key are unique.

6 multifun Environments can be considered as hashtables. The keys are obviously strings, but in some cases the associated values are also strings. This is the case for annotation environments (as built with the package AnnBuilder). This function helps to look for values across several environments: the keys have associated values in a first environment, these values are used as keys in the second environments, etc... Value A list of length the length of x. Author(s) Laurent Gautier See Also mget data(ecoligenomebnum) data(ecoligenomebnum2multifun) data(multifun) ## get 5 Affymetrix IDs set.seed(456) my.affyids <- sample(ls(ecoligenomebnum), 5) ## get the MULTIFUN annotations for them r <- linkedmultiget(my.affyids, list(ecoligenomebnum, ecoligenomebnum2multifun, multifun)) print(r) multifun multifun classification The MultiFun classification scheme data(multifun) data(ecoligenomemultifun2go) Format These are environments.

pointscircle 7 Source http://genprotec.mbl.edu/files/multifun.txt ## To be done... pointscircle Functions to plot circular related figures Functions to plot circular related figures linescircle(radius, center.x = 0, center.y = 0, edges = 300,...) polygondisk(radius, center.x = 0, center.y = 0, edges=300,...) arrowsarc(theta0, theta1, radius, center.x = 0, center.y = 0, edges = 10, length = 0.25, angle = 30, code = 2,...) pointsarc(theta0, theta1, radius, center.x = 0, center.y = 0,...) linesarc(theta0, theta1, radius, center.x = 0, center.y = 0,...) polygonarc(theta0, theta1, radius.in, radius.out, center.x = 0, center.y = 0, edges = 10, col = "black", border = NA,...) Arguments theta0, theta1 radius radius.in start and end angles for the arc radius of the circle inner radius radius.out outer radius center.x, center.y Coordinates for the center of the circle (default to (0, 0)) edges col number of edges the shape is made of color border border (see polygon) length, angle, code see the corresponding parameters for the function arrows... optional graphical paramaters to come... for now the best to run the examples and experiment by yourself...

8 polar2xy Value Function only used for their border effects. Author(s) laurent par(mfrow=c(2,2)) n <- 10 thetas <- rev(seq(0, 2 * pi, length=n)) rhos <- rev(seq(1, n) / n) xy <- polar2xy(rhos, thetas) colo <- heat.colors(n) plot(0, 0, xlim=c(-2, 2), ylim=c(-2, 2), type="n") for (i in 1:n) linescircle(rhos[i]/2, xy$x[i], xy$y[i]) plot(0, 0, xlim=c(-2, 2), ylim=c(-2, 2), type="n") for (i in 1:n) polygondisk(rhos[i]/2, xy$x[i], xy$y[i], col=colo[i]) plot(0, 0, xlim=c(-2, 2), ylim=c(-2, 2), type="n", xlab="", ylab="") for (i in 1:n) polygonarc(0, thetas[i], rhos[i]/2, rhos[i], center.x = xy$x[i], center.y = xy$y[i], col=colo[i]) plot(0, 0, xlim=c(-2, 2), ylim=c(-2, 2), type="n", xlab="", ylab="") for (i in (1:n)[-1]) { linescircle(rhos[i-1], col="gray", lty=2) polygonarc(thetas[i-1], thetas[i], rhos[i-1], rhos[i], col=colo[i], edges=20) arrowsarc(thetas[i-1], thetas[i], rhos[i] + 1, col=colo[i], edges=20) } polar2xy Functions to perform polar coordinate related functions Functions to perform polar coordinate related functions

polygonchrom 9 polar2xy(rho, theta) xy2polar(x, y) rotate(x, y, alpha) Arguments x y rho theta alpha cartesian coordinate cartesian coordinate polar radius rho polar angle theta angle to perform rotation y and theta can be respectively missing. In this case, x and rho are expected to be lists with entries x,y, rho,theta respectively. n <- 40 nn <- 2 thetas <- seq(0, nn * 2 * pi, length=n) rhos <- seq(1, n) / n plot(c(-1, 1), c(-1, 1), type="n") abline(h=0, col="grey") abline(v=0, col="grey") xy <- polar2xy(rhos, thetas) points(xy$x, xy$y, col=rainbow(n)) polygonchrom Functions to plot circular chromosomes informations Functions to plot circular chromosomes informations cplotcircle(radius=1, xlim=c(-2, 2), ylim=xlim, edges=300, main=null, main.inside,...) chrompos2angle(pos, len.chrom, rot=pi/2, clockwise=true)

10 polygonchrom polygonchrom(begin, end, len.chrom, radius.in, radius.out, total.edges = 300, edges = max(round(abs(end - begin)/len.chrom * total.edges), 2, na.rm = TRUE), rot = pi/2, clockwise = TRUE,...) lineschrom(begin, end, len.chrom, radius, total.edges = 300, edges = max(round(abs(end - begin)/len.chrom * total.edges), 2, na.rm = TRUE), rot = pi/2, clockwise = TRUE,...) ecoli.len Arguments radius xlim, ylim pos begin end len.chrom radius.in radius.out total.edges edges rot radius range for the plot. Can be used to zoom-in a particular region. position (nucleic base coordinate) begining of the segment (nucleic base number). end of the segment (nucleic base number). length of the chromosome in base pairs inner radius outer radius total number of edges for the chromosome number of edges for the specific segment(s) rotation (default is pi / 2, bringing the angle zero at 12 o clock) clockwise rotate clockwise. Default to TRUE. main, main.inside main titles for the plot... optional graphical parameters The function chrompos2angle is a convenience function. The variable ecoli.len contains the size of the Escheria coli genome considered (K12). Value Except chrompos2angle, the function are solely used for their border effects. Author(s) laurent <laurent@cbs.dtu.dk>

wstringapply 11 data(ecoligenomesymbol2affy) data(ecoligenomechrloc) ## find the operon lactose ("lac*" genes) lac.i <- grep("^lac", ls(ecoligenomesymbol2affy)) lac.symbol <- ls(ecoligenomesymbol2affy)[lac.i] lac.affy <- unlist(lapply(lac.symbol, get, envir=ecoligenomesymbol2affy)) beg.end <- lapply(lac.affy, get, envir=ecoligenomechrloc) beg.end <- matrix(unlist(beg.end), nc=2, byrow=true) lac.o <- order(beg.end[, 1]) lac.i <- lac.i[lac.o] lac.symbol <- lac.symbol[lac.o] lac.affy <- lac.affy[lac.o] beg.end <- beg.end[lac.o, ] lac.col <- rainbow(length(lac.affy)) par(mfrow=c(2,2)) ## plot cplotcircle(main="lac genes") polygonchrom(beg.end[, 1], beg.end[, 2], ecoli.len, 1, 1.2, col=lac.col) rect(0, 0, 1.1, 1.1, border="red") cplotcircle(xlim=c(0, 1.2), ylim=c(0, 1.1)) polygonchrom(beg.end[, 1], beg.end[, 2], ecoli.len, 1, 1.1, col=lac.col) rect(0.4, 0.8, 0.7, 1.1, border="red") cplotcircle(xlim=c(.45,.5), ylim=c(.85, 1.0)) polygonchrom(beg.end[, 1], beg.end[, 2], ecoli.len, 1, 1.03, col=lac.col) mid.genes <- apply(beg.end, 1, mean) mid.angles <- chrompos2angle(mid.genes, ecoli.len) xy <- polar2xy(1.03, mid.angles) xy.labels <- data.frame(x = seq(0.45, 0.5, length=4), y = seq(0.95, 1.0, length=4)) segments(xy$x, xy$y, xy.labels$x, xy.labels$y, col=lac.col) text(xy.labels$x, xy.labels$y, lac.symbol, col=lac.col) wstringapply Apply a function on a window sliding on a string Apply a function on a window sliding on a string. wstringapply(x, SIZE, SLIDE, FUN,...)

12 wstringapply Arguments x The string SIZE The size of the window (number of characters). SLIDE offset to move at each slide FUN The function to be applied... optional parameter for the function FUN Apply the function FUN to substrings of x of length SIZE. Value A list of size nchar(x) -SIZE. Author(s) L, Gautier

Index Topic aplot pointscircle, 7 Topic datasets ecoli.m52.genome, 2 ecoligenome.operon, 3 ecoligenomebnum, 4 ecoligenomebnum2multifun, 4 multifun, 6 Topic dplot polygonchrom, 9 Topic hplot polygonchrom, 9 Topic manip gccontent, 5 linkedmultiget, 5 polar2xy, 8 wstringapply, 11 arrows, 7 arrowsarc (pointscircle), 7 chrompos2angle (polygonchrom), 9 cplotcircle (polygonchrom), 9 ecoli.len (ecoli.m52.genome), 2 ecoli.m52.genome, 2 ecoli.operon (ecoli.m52.genome), 2 ecoligenome.operon, 3 ecoligenomebnum, 4 ecoligenomebnum2enzyme (ecoligenomebnum), 4 ecoligenomebnum2genbank (ecoligenomebnum), 4 ecoligenomebnum2geneproduct (ecoligenomebnum), 4 ecoligenomebnum2multifun, 4 ecoligenomebnum2strand (ecoligenomebnum2multifun), 4 ecoligenomebnum2symbol (ecoligenomebnum), 4 ecoligenomechrloc (ecoli.m52.genome), 2 ecoligenomemultifun2go (multifun), 6 ecoligenomestrand (ecoli.m52.genome), 2 ecoligenomesymbol (ecoli.m52.genome), 2 ecoligenomesymbol2affy (ecoli.m52.genome), 2 ecoligenomesymbol2bnum (ecoligenomebnum), 4 gccontent, 5 linesarc (pointscircle), 7 lineschrom (polygonchrom), 9 linescircle (pointscircle), 7 linkedmultiget, 5 mget, 6 multifun, 6 pointsarc (pointscircle), 7 pointscircle, 7 polar2xy, 8 polygon, 7 polygonarc (pointscircle), 7 polygonchrom, 9 polygondisk (pointscircle), 7 rotate (polar2xy), 8 wstringapply, 11 xy2polar (polar2xy), 8 13