Network analysis. Martina Kutmon Department of Bioinformatics Maastricht University

Size: px
Start display at page:

Download "Network analysis. Martina Kutmon Department of Bioinformatics Maastricht University"

Transcription

1 Network analysis Martina Kutmon Department of Bioinformatics Maastricht University

2 What's gonna happen today? Network Analysis Introduction Quiz Hands-on session ConsensusPathDB interaction database

3

4 Outline 1. Terminology 2. Examples of networks 3. Network properties 4. Use cases in biology 5. Interaction data 6. Introduction practical session - Cytoscape

5 1. Terminology

6 Terminology Nodes and Edges nodes are the objects in the network edges are links and interactions in the network Node Edge Node

7 Terminology Graph vs. Network graph = generic term for a mathematical concept of a set of nodes connected by links called edges network = graph = representation of a set of objects where some pairs of objects are connected by links V = {1, 2, 3, 4, 5, 6} E = {{1, 2}, {1, 5}, {2, 3}, {2, 5}, {3, 4}, {4, 5}, {4, 6}}.

8 Terminology Neighbor a neighbor in a network is a node that is linked through a direct edge A Edge A is a neighbor of B B is a neighbor of A and C C is a neighbor of B B Edge C

9 Terminology Path a path is a sequence of edges which connect a sequence of nodes A Path from C to E: C --> B --> D --> E length = 3 B C D E

10 2. Examples of networks

11 Networks everywhere

12 Networks everywhere Seven Bridges of Königsberg Leonhard Euler, 1736 Euler proved that this problem can't be solved and laid the foundations of graph theory.

13 Networks everywhere

14 Networks everywhere

15 Networks everywhere

16 Networks everywhere

17 Networks everywhere

18 Networks everywhere

19 Biological networks Metabolic networks Gene networks Protein networks Nodes Metabolites Nodes TFs, target genes Nodes Proteins Edges Enzymatic conversions Edges Transcriptional interaction Edges Physical / functional interaction

20 3. Network, Node and Edge Properties

21 Network properties Directed vs. undirected

22 Network properties Cyclic vs. acyclic

23 Network properties Weighted vs. unweighted

24 Node Properties How influential is a person in a social network? How important is a room in a building? How important is a transcription factor in a biological process? How much influence has a mutation in a protein?

25 Node Properties Degree centrality: number of edges incident upon a node Undirected: only node degree! Directed: in vs. out degree

26 Node Properties Degree centrality: number of edges incident upon a node

27 Node Properties Degree centrality Biological interpretation: nodes with a high degree are often essential elements in the network transcription factors have a high out-degree studies showed that proteins with a high degree are more likely to be essential for survival

28 Node Properties Closeness centrality: the inverse of the average of the closest paths to all other nodes C(n) = 1 / avg( L(n,m) )) F -> A = 2 F -> B = 3 F -> D = 2 F -> C = 1 F -> E = 2 (F - C - A) (F - C - D - B) (F - C - D) (F - C) (F - C - E) avg(l(n,m)) = ( ) /5 = 2 C(n) = 1 / 2 = 0.5

29 Node Properties Closeness centrality: the inverse of the average of the closest paths to all other nodes C(n) = 1 / avg( L(n,m) )) C -> A = 1 C -> B = 2 C -> D = 1 C -> F = 1 C -> E = 1 avg(l(n,m)) = ( ) /5 = 1.2 C(n) = 1 / 2 = 0.833

30 Node Properties Closeness centrality Biological interpretation: indication for how fast information spreads from a given node to other reachable nodes in the network the more central a node is, the lower is the distance to all other nodes, the higher is the closeness

31 Node Properties Betweenness centrality: Cb(n) = s n t (σst (n) / σst) - calculates the node betweenness for node n - σst = number of shortest path from s to t - σst (n) = number of shortest path from s to t that go through n - repeat this for each node pair in the network

32 Node Properties Betweenness centrality Biological interpretation: amount of control that this node exerts over the interactions of other nodes in the network how much information load is on the node connectivity of the network hubs that connect to subnetworks Applies to edges as well!

33 4. Use Cases in Biology

34 Identification of hubs Hub elements tend to be essential Often involved in multiple processes Important for the overall connectivity of the network Brodsky, Igor E., and Ruslan Medzhitov. "Targeting of immune signalling networks by bacterial pathogens." Nature cell biology 11.5 (2009):

35 Identification of hubs Brodsky, Igor E., and Ruslan Medzhitov. "Targeting of immune signalling networks by bacterial pathogens." Nature cell biology 11.5 (2009):

36 Clustering Grouping a set of objects in such a way that objects within a cluster are more similar (in some sense of another) to each other than to those in other clusters. Find objects that behave similar, have similar properties, are close to each other,...

37 Clustering Tan, Kai, et al. "Transcriptional regulation of protein complexes within and across species." Proceedings of the National Academy of Sciences (2007):

38 Network Motifs Network motifs -> significantly recurring connected subnetwork Types of networks show different motifs Wang, Edwin, and Enrico Purisima. "Network motifs are enriched with transcription factors whose transcripts have short halflives." Trends in Genetics21.9 (2005):

39 Active Subnetworks Search for expression activated subnetworks Connected parts in the network that show significant changes in expression over particular subsets of conditions

40 Active Subnetworks

41 Active Subnetworks Breast cancer data Basal subtype of the cancer Identification of positive feed forward loops Gaire, Raj K., et al. "Discovery and analysis of consistent active sub-networks in cancers." BMC bioinformatics 14.Suppl 2 (2013): S7.

42 Data Integration Big challenge in network biology: data integration increased complexity proteins, metabolites, genes SNP data, transcripts, micrornas, cells,...

43 Network Inference A lot of different methods Most commonly used = correlation networks Predicting the "wiring diagram" of the network Calculate the correlation between two genes based on their expression Use correlation between genes as edges 20,000 genes -> ~8GB data -> ~3 hrs calculation time

44 Network Inference Correlation does not tell you the direction of the edge! Data derived networks Combine with other interaction data like IntAct or STRING

45 5. Interaction Data

46 PathGuide pathway data interaction data IntAct, STRING, MINT, Pazar, mirecords, microcosm, DrugBank, BIND,...

47 IntAct

48 STRING

49 mirecords

50 5. Introduction Quiz and Hands-on Session

51 Quiz Work alone or in groups Get familiar with the tools

52 Cytoscape Network visualization and analysis tool > 100 plugins (now called apps) available for additional functionality Not only biological networks, but also social sciences, semantic web, complex network analysis

53 Cytoscape Cytoscape with plugins (or now called apps) Haven't installed Cytoscape yet? Ask for a USB stick.

54 Cytoscape

55 Cytoscape plugins / apps

56 Open Network Always: File -> Import!

57 NetworkMerge Plugins -> Advanced Network Merge

58 NetworkAnalyzer Calculates all the node properties in the network

59 NetworkAnalyzer

60 Layouts Grid Layout Circular Layout

61 Layouts Hierarchical Layout Forced-Directed Layout

62 VizMapper - change the visual style color your nodes (e.g. based on expression)

63 VizMapper - change the visual style add icons on your nodes

64 VizMapper - change the visual style change the background color

65 VizMapper - change the visual style change node size and colors base on node properties

66 VizMapper - change the visual style pie charts on nodes

67 VizMapper - change the visual style highlight a subnetwork

68 Hands on session 1. Analyze a protein interaction network from STRING 2. Load a pathway from WikiPathways and extend it with regulatory interactions (micrornas, TFs and drugs) -> CyTargetLinker plugin 3. Find interactions between a group of genes visualize expression data on the network (MiMI plugin)

69 CyTargetLinker Integrate regulatory interactions into network analysis Generic and flexible - different regulatory interactions can be integrated together TF-gene interactions from ENCODE, microrna-target from mirtarbase and mirecords and drug-targets from DrugBank

70 CyTargetLinker Results panel shows how many interactions have been added Colors of the edges indicate from which resource the interaction comes from (data is extracted from so called RINs - regulatory interaction networks)

71 MiMI Michigan Molecular Interactions Find interactions between query genes (and first neighbors) Only supports query by gene name (since many genes have synonyms, genes might not return any result)

72 Hands on session After quiz and coffee Instructions available on projects.bigcat.unimaas.nl/ebi-roadshow/

73 Thanks to Anwesha Egon

74 Questions?

Biological Networks Analysis

Biological Networks Analysis Biological Networks Analysis Introduction and Dijkstra s algorithm Genome 559: Introduction to Statistical and Computational Genomics Elhanan Borenstein The clustering problem: partition genes into distinct

More information

A quick review. The clustering problem: Hierarchical clustering algorithm: Many possible distance metrics K-mean clustering algorithm:

A quick review. The clustering problem: Hierarchical clustering algorithm: Many possible distance metrics K-mean clustering algorithm: The clustering problem: partition genes into distinct sets with high homogeneity and high separation Hierarchical clustering algorithm: 1. Assign each object to a separate cluster.. Regroup the pair of

More information

Biological Networks Analysis

Biological Networks Analysis iological Networks nalysis Introduction and ijkstra s algorithm Genome 559: Introduction to Statistical and omputational Genomics Elhanan orenstein The clustering problem: partition genes into distinct

More information

A quick review. Which molecular processes/functions are involved in a certain phenotype (e.g., disease, stress response, etc.)

A quick review. Which molecular processes/functions are involved in a certain phenotype (e.g., disease, stress response, etc.) Gene expression profiling A quick review Which molecular processes/functions are involved in a certain phenotype (e.g., disease, stress response, etc.) The Gene Ontology (GO) Project Provides shared vocabulary/annotation

More information

V 1 Introduction! Mon, Oct 15, 2012! Bioinformatics 3 Volkhard Helms!

V 1 Introduction! Mon, Oct 15, 2012! Bioinformatics 3 Volkhard Helms! V 1 Introduction! Mon, Oct 15, 2012! Bioinformatics 3 Volkhard Helms! How Does a Cell Work?! A cell is a crowded environment! => many different proteins,! metabolites, compartments,! On a microscopic level!

More information

An overview of Cytoscape for network biology with a focus on residue interaction networks

An overview of Cytoscape for network biology with a focus on residue interaction networks An overview of Cytoscape for network biology with a focus on residue interaction networks Guillaume Brysbaert IR2 CNRS - Bioinformatics - Unit of Structural and Functional Glycobiology Team: Computational

More information

Nick Hamilton Institute for Molecular Bioscience. Essential Graph Theory for Biologists. Image: Matt Moores, The Visible Cell

Nick Hamilton Institute for Molecular Bioscience. Essential Graph Theory for Biologists. Image: Matt Moores, The Visible Cell Nick Hamilton Institute for Molecular Bioscience Essential Graph Theory for Biologists Image: Matt Moores, The Visible Cell Outline Core definitions Which are the most important bits? What happens when

More information

Cytoscape:un approccio sistematico all'analisi dei dati sperimentali

Cytoscape:un approccio sistematico all'analisi dei dati sperimentali Cytoscape:un approccio sistematico all'analisi dei dati sperimentali Michele Petterlini michele@petterlini.it Università di Verona Centro di BioMedicina Computazionale The network Network: a graph representation

More information

Graph Theory. Graph Theory. COURSE: Introduction to Biological Networks. Euler s Solution LECTURE 1: INTRODUCTION TO NETWORKS.

Graph Theory. Graph Theory. COURSE: Introduction to Biological Networks. Euler s Solution LECTURE 1: INTRODUCTION TO NETWORKS. Graph Theory COURSE: Introduction to Biological Networks LECTURE 1: INTRODUCTION TO NETWORKS Arun Krishnan Koenigsberg, Russia Is it possible to walk with a route that crosses each bridge exactly once,

More information

BioNSi - Biological Network Simulation Tool

BioNSi - Biological Network Simulation Tool Workshop: BioNSi - Biological Network Simulation Tool Amir Rubinstein CS @ TAU Tel-Aviv University, Faculty of Life Sciences 8 May 2016 Outline Part I Basic notions: - Modeling and simulation - Crash intro

More information

Properties of Biological Networks

Properties of Biological Networks Properties of Biological Networks presented by: Ola Hamud June 12, 2013 Supervisor: Prof. Ron Pinter Based on: NETWORK BIOLOGY: UNDERSTANDING THE CELL S FUNCTIONAL ORGANIZATION By Albert-László Barabási

More information

An Exploratory Journey Into Network Analysis A Gentle Introduction to Network Science and Graph Visualization

An Exploratory Journey Into Network Analysis A Gentle Introduction to Network Science and Graph Visualization An Exploratory Journey Into Network Analysis A Gentle Introduction to Network Science and Graph Visualization Pedro Ribeiro (DCC/FCUP & CRACS/INESC-TEC) Part 1 Motivation and emergence of Network Science

More information

The Generalized Topological Overlap Matrix in Biological Network Analysis

The Generalized Topological Overlap Matrix in Biological Network Analysis The Generalized Topological Overlap Matrix in Biological Network Analysis Andy Yip, Steve Horvath Email: shorvath@mednet.ucla.edu Depts Human Genetics and Biostatistics, University of California, Los Angeles

More information

MetScape User Manual

MetScape User Manual MetScape 2.3.2 User Manual A Plugin for Cytoscape National Center for Integrative Biomedical Informatics July 2012 2011 University of Michigan This work is supported by the National Center for Integrative

More information

Advanced Algorithms and Models for Computational Biology -- a machine learning approach

Advanced Algorithms and Models for Computational Biology -- a machine learning approach Advanced Algorithms and Models for Computational Biology -- a machine learning approach Biological Networks & Network Evolution Eric Xing Lecture 22, April 10, 2006 Reading: Molecular Networks Interaction

More information

Structure of biological networks. Presentation by Atanas Kamburov

Structure of biological networks. Presentation by Atanas Kamburov Structure of biological networks Presentation by Atanas Kamburov Seminar Gute Ideen in der theoretischen Biologie / Systembiologie 08.05.2007 Overview Motivation Definitions Large-scale properties of cellular

More information

IPA: networks generation algorithm

IPA: networks generation algorithm IPA: networks generation algorithm Dr. Michael Shmoish Bioinformatics Knowledge Unit, Head The Lorry I. Lokey Interdisciplinary Center for Life Sciences and Engineering Technion Israel Institute of Technology

More information

CyKEGGParser User Manual

CyKEGGParser User Manual CyKEGGParser User Manual Table of Contents Introduction... 3 Development... 3 Citation... 3 License... 3 Getting started... 4 Pathway loading... 4 Laoding KEGG pathways from local KGML files... 4 Importing

More information

The quantitative analysis of interactions takes bioinformatics to the next higher dimension: we go from 1D to 2D with graph theory.

The quantitative analysis of interactions takes bioinformatics to the next higher dimension: we go from 1D to 2D with graph theory. 1 The human protein-protein interaction network of aging-associated genes. A total of 261 aging-associated genes were assembled using the GenAge Human Database. Protein-protein interactions of the human

More information

9/29/13. Outline Data mining tasks. Clustering algorithms. Applications of clustering in biology

9/29/13. Outline Data mining tasks. Clustering algorithms. Applications of clustering in biology 9/9/ I9 Introduction to Bioinformatics, Clustering algorithms Yuzhen Ye (yye@indiana.edu) School of Informatics & Computing, IUB Outline Data mining tasks Predictive tasks vs descriptive tasks Example

More information

Evolutionary origins of modularity

Evolutionary origins of modularity Evolutionary origins of modularity Jeff Clune, Jean-Baptiste Mouret and Hod Lipson Proceedings of the Royal Society B 2013 Presented by Raghav Partha Evolvability Evolvability capacity to rapidly adapt

More information

Network visualization and analysis with Cytoscape. Based on slides by Gary Bader (U Toronto)

Network visualization and analysis with Cytoscape. Based on slides by Gary Bader (U Toronto) Network visualization and analysis with Cytoscape Based on slides by Gary Bader (U Toronto) Network Analysis Workflow Load Networks e.g. PPI data Import network data into Cytoscape Load Attributes e.g.

More information

Network Visualization: Cytoscape

Network Visualization: Cytoscape Network Visualization: Cytoscape Ritchie Lab Center for Systems Genomics Pennsylvania State University September 13, 2014 What is Cytoscape? Cytoscape is an open source software platform for visualizing

More information

SEEK User Manual. Introduction

SEEK User Manual. Introduction SEEK User Manual Introduction SEEK is a computational gene co-expression search engine. It utilizes a vast human gene expression compendium to deliver fast, integrative, cross-platform co-expression analyses.

More information

CONTENTS 1. Contents

CONTENTS 1. Contents BIANA Tutorial CONTENTS 1 Contents 1 Getting Started 6 1.1 Starting BIANA......................... 6 1.2 Creating a new BIANA Database................ 8 1.3 Parsing External Databases...................

More information

Introduction to Bioinformatics

Introduction to Bioinformatics Introduction to Bioinformatics Biological Networks Department of Computing Imperial College London Spring 2010 1. Motivation Large Networks model many real-world phenomena technological: www, internet,

More information

SciMiner User s Manual

SciMiner User s Manual SciMiner User s Manual Copyright 2008 Junguk Hur. All rights reserved. Bioinformatics Program University of Michigan Ann Arbor, MI 48109, USA Email: juhur@umich.edu Homepage: http://jdrf.neurology.med.umich.edu/sciminer/

More information

Networks in economics and finance. Lecture 1 - Measuring networks

Networks in economics and finance. Lecture 1 - Measuring networks Networks in economics and finance Lecture 1 - Measuring networks What are networks and why study them? A network is a set of items (nodes) connected by edges or links. Units (nodes) Individuals Firms Banks

More information

WebGestalt Manual. January 30, 2013

WebGestalt Manual. January 30, 2013 WebGestalt Manual January 30, 2013 The Web-based Gene Set Analysis Toolkit (WebGestalt) is a suite of tools for functional enrichment analysis in various biological contexts. WebGestalt compares a user

More information

Graph Data Management Systems in New Applications Domains. Mikko Halin

Graph Data Management Systems in New Applications Domains. Mikko Halin Graph Data Management Systems in New Applications Domains Mikko Halin Introduction Presentation is based on two papers Graph Data Management Systems for New Application Domains - Philippe Cudré-Mauroux,

More information

Unstructured Text in Big Data The Elephant in the Room

Unstructured Text in Big Data The Elephant in the Room Unstructured Text in Big Data The Elephant in the Room David Milward ICIC, October 2013 Click Unstructured to to edit edit Master Master Big title Data style title style Big Data Volume, Variety, Velocity

More information

Introduction to Systems Biology II: Lab

Introduction to Systems Biology II: Lab Introduction to Systems Biology II: Lab Amin Emad NIH BD2K KnowEnG Center of Excellence in Big Data Computing Carl R. Woese Institute for Genomic Biology Department of Computer Science University of Illinois

More information

Biological Networks Analysis Network Motifs. Genome 373 Genomic Informatics Elhanan Borenstein

Biological Networks Analysis Network Motifs. Genome 373 Genomic Informatics Elhanan Borenstein Biological Networks Analysis Network Motifs Genome 373 Genomic Informatics Elhanan Borenstein Networks: Networks vs. graphs A collection of nodes and links A quick review Directed/undirected; weighted/non-weighted,

More information

Various Graphs and Their Applications in Real World

Various Graphs and Their Applications in Real World Various Graphs and Their Applications in Real World Pranav Patel M. Tech. Computer Science and Engineering Chirag Patel M. Tech. Computer Science and Engineering Abstract This day s usage of computers

More information

Incoming, Outgoing Degree and Importance Analysis of Network Motifs

Incoming, Outgoing Degree and Importance Analysis of Network Motifs Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 4, Issue. 6, June 2015, pg.758

More information

Lecture 5. Functional Analysis with Blast2GO Enriched functions. Kegg Pathway Analysis Functional Similarities B2G-Far. FatiGO Babelomics.

Lecture 5. Functional Analysis with Blast2GO Enriched functions. Kegg Pathway Analysis Functional Similarities B2G-Far. FatiGO Babelomics. Lecture 5 Functional Analysis with Blast2GO Enriched functions FatiGO Babelomics FatiScan Kegg Pathway Analysis Functional Similarities B2G-Far 1 Fisher's Exact Test One Gene List (A) The other list (B)

More information

FunRich Tool Documentation

FunRich Tool Documentation FunRich Tool Documentation Version 2.1.2 Shivakumar Keerthikumar, Mohashin Pathan, Johnson Agbinya and Suresh Mathivanan Mathivanan Lab http://www.mathivananlab.org La Trobe University LIMS1 Department

More information

Min Wang. April, 2003

Min Wang. April, 2003 Development of a co-regulated gene expression analysis tool (CREAT) By Min Wang April, 2003 Project Documentation Description of CREAT CREAT (coordinated regulatory element analysis tool) are developed

More information

Example for calculation of clustering coefficient Node N 1 has 8 neighbors (red arrows) There are 12 connectivities among neighbors (blue arrows)

Example for calculation of clustering coefficient Node N 1 has 8 neighbors (red arrows) There are 12 connectivities among neighbors (blue arrows) Example for calculation of clustering coefficient Node N 1 has 8 neighbors (red arrows) There are 12 connectivities among neighbors (blue arrows) Average clustering coefficient of a graph Overall measure

More information

Some Graph Theory for Network Analysis. CS 249B: Science of Networks Week 01: Thursday, 01/31/08 Daniel Bilar Wellesley College Spring 2008

Some Graph Theory for Network Analysis. CS 249B: Science of Networks Week 01: Thursday, 01/31/08 Daniel Bilar Wellesley College Spring 2008 Some Graph Theory for Network Analysis CS 9B: Science of Networks Week 0: Thursday, 0//08 Daniel Bilar Wellesley College Spring 008 Goals this lecture Introduce you to some jargon what we call things in

More information

Blast2GO Teaching Exercises

Blast2GO Teaching Exercises Blast2GO Teaching Exercises Ana Conesa and Stefan Götz 2012 BioBam Bioinformatics S.L. Valencia, Spain Contents 1 Annotate 10 sequences with Blast2GO 2 2 Perform a complete annotation process with Blast2GO

More information

Predicting Disease-related Genes using Integrated Biomedical Networks

Predicting Disease-related Genes using Integrated Biomedical Networks Predicting Disease-related Genes using Integrated Biomedical Networks Jiajie Peng (jiajiepeng@nwpu.edu.cn) HanshengXue(xhs1892@gmail.com) Jin Chen* (chen.jin@uky.edu) Yadong Wang* (ydwang@hit.edu.cn) 1

More information

SDCSB Cytoscape Workshop 12/4/2012 Keiichiro Ono

SDCSB Cytoscape Workshop 12/4/2012 Keiichiro Ono Cytoscape Basic Tutorial SDCSB Cytoscape Workshop 12/4/2012 Keiichiro Ono Navigating Cytoscape Navigating Cytoscape This section will introduce the Cytoscape user interface. First of all we will look at

More information

Special-topic lecture bioinformatics: Mathematics of Biological Networks

Special-topic lecture bioinformatics: Mathematics of Biological Networks Special-topic lecture bioinformatics: Leistungspunkte/Credit points: 5 (V2/Ü1) This course is taught in English language. The material (from books and original literature) are provided online at the course

More information

Foundations of Machine Learning CentraleSupélec Fall Clustering Chloé-Agathe Azencot

Foundations of Machine Learning CentraleSupélec Fall Clustering Chloé-Agathe Azencot Foundations of Machine Learning CentraleSupélec Fall 2017 12. Clustering Chloé-Agathe Azencot Centre for Computational Biology, Mines ParisTech chloe-agathe.azencott@mines-paristech.fr Learning objectives

More information

VisANT 4.0 User Manual. Contents

VisANT 4.0 User Manual. Contents VisANT 4.0 User Manual Contents Visualization of Disease, Therapy and GO Hierarchie s Using Hierarchy Explorer... 2 Navigation of Hierarchies... 2 Searching the Hierarchy... 3 Searching Terms Using Key

More information

Tutorial:OverRepresentation - OpenTutorials

Tutorial:OverRepresentation - OpenTutorials Tutorial:OverRepresentation From OpenTutorials Slideshow OverRepresentation (about 12 minutes) (http://opentutorials.rbvi.ucsf.edu/index.php?title=tutorial:overrepresentation& ce_slide=true&ce_style=cytoscape)

More information

Clustering analysis of gene expression data

Clustering analysis of gene expression data Clustering analysis of gene expression data Chapter 11 in Jonathan Pevsner, Bioinformatics and Functional Genomics, 3 rd edition (Chapter 9 in 2 nd edition) Human T cell expression data The matrix contains

More information

Genetic Programming. Charles Chilaka. Department of Computational Science Memorial University of Newfoundland

Genetic Programming. Charles Chilaka. Department of Computational Science Memorial University of Newfoundland Genetic Programming Charles Chilaka Department of Computational Science Memorial University of Newfoundland Class Project for Bio 4241 March 27, 2014 Charles Chilaka (MUN) Genetic algorithms and programming

More information

Pathway Studio Quick Start Guide

Pathway Studio Quick Start Guide Pathway Studio Quick Start Guide This Quick Start Guide is for users of the Pathway Studio 4.0 pathway analysis software. The Quick Start Guide demonstrates the key features of the software and provides

More information

TieDIE Tutorial. Version 1.0. Evan Paull

TieDIE Tutorial. Version 1.0. Evan Paull TieDIE Tutorial Version 1.0 Evan Paull June 9, 2013 Contents A Signaling Pathway Example 2 Introduction............................................ 2 TieDIE Input Format......................................

More information

Tutorial. RNA-Seq Analysis of Breast Cancer Data. Sample to Insight. November 21, 2017

Tutorial. RNA-Seq Analysis of Breast Cancer Data. Sample to Insight. November 21, 2017 RNA-Seq Analysis of Breast Cancer Data November 21, 2017 Sample to Insight QIAGEN Aarhus Silkeborgvej 2 Prismet 8000 Aarhus C Denmark Telephone: +45 70 22 32 44 www.qiagenbioinformatics.com AdvancedGenomicsSupport@qiagen.com

More information

Approximate Graph Patterns for Biological Network

Approximate Graph Patterns for Biological Network Approximate Graph Patterns for Biological Network 1) Conserved patterns of protein interaction in multiple species - Proceedings of the National Academy of Science (PNAS) - 2003 2) Conserved pathways within

More information

User guide: magnum (v1.0)

User guide: magnum (v1.0) User guide: magnum (v1.0) Daniel Marbach October 5, 2015 Table of contents 1. Synopsis. 2 2. Introduction. 3 3. Step-by-step tutorial.. 4 3.1. Connectivity enrichment anlaysis... 4 3.2. Loading settings

More information

Special course in Computer Science: Advanced Text Algorithms

Special course in Computer Science: Advanced Text Algorithms Special course in Computer Science: Advanced Text Algorithms Lecture 8: Multiple alignments Elena Czeizler and Ion Petre Department of IT, Abo Akademi Computational Biomodelling Laboratory http://www.users.abo.fi/ipetre/textalg

More information

GraphLinker. A Visual Comparative Environment of Genomic and Metabolic Networks. Project Update. Niels Hanson

GraphLinker. A Visual Comparative Environment of Genomic and Metabolic Networks. Project Update. Niels Hanson GraphLinker A Visual Comparative Environment of Genomic and Metabolic Networks Project Update Niels Hanson Monday, November 21 2011 Microbacterial Communities diversity and dynamics largely unexplored

More information

Graphs Eulerian and Hamiltonian Applications Graph layout software. Graphs. SET07106 Mathematics for Software Engineering

Graphs Eulerian and Hamiltonian Applications Graph layout software. Graphs. SET07106 Mathematics for Software Engineering Graphs SET76 Mathematics for Software Engineering School of Computing Edinburgh Napier University Module Leader: Uta Priss 2 Copyright Edinburgh Napier University Graphs Slide /3 Outline Graphs Eulerian

More information

Introduction to GE Microarray data analysis Practical Course MolBio 2012

Introduction to GE Microarray data analysis Practical Course MolBio 2012 Introduction to GE Microarray data analysis Practical Course MolBio 2012 Claudia Pommerenke Nov-2012 Transkriptomanalyselabor TAL Microarray and Deep Sequencing Core Facility Göttingen University Medical

More information

Database and Knowledge-Base Systems: Data Mining. Martin Ester

Database and Knowledge-Base Systems: Data Mining. Martin Ester Database and Knowledge-Base Systems: Data Mining Martin Ester Simon Fraser University School of Computing Science Graduate Course Spring 2006 CMPT 843, SFU, Martin Ester, 1-06 1 Introduction [Fayyad, Piatetsky-Shapiro

More information

저작권법에따른이용자의권리는위의내용에의하여영향을받지않습니다.

저작권법에따른이용자의권리는위의내용에의하여영향을받지않습니다. 저작자표시 - 비영리 - 변경금지 2.0 대한민국 이용자는아래의조건을따르는경우에한하여자유롭게 이저작물을복제, 배포, 전송, 전시, 공연및방송할수있습니다. 다음과같은조건을따라야합니다 : 저작자표시. 귀하는원저작자를표시하여야합니다. 비영리. 귀하는이저작물을영리목적으로이용할수없습니다. 변경금지. 귀하는이저작물을개작, 변형또는가공할수없습니다. 귀하는, 이저작물의재이용이나배포의경우,

More information

Powering Knowledge Discovery. Insights from big data with Linguamatics I2E

Powering Knowledge Discovery. Insights from big data with Linguamatics I2E Powering Knowledge Discovery Insights from big data with Linguamatics I2E Gain actionable insights from unstructured data The world now generates an overwhelming amount of data, most of it written in natural

More information

Department of Computer Science & Engineering University of Kalyani. Syllabus for Ph.D. Coursework

Department of Computer Science & Engineering University of Kalyani. Syllabus for Ph.D. Coursework Department of Computer Science & Engineering University of Kalyani Syllabus for Ph.D. Coursework Paper 1: A) Literature Review: (Marks - 25) B) Research Methodology: (Marks - 25) Paper 2: Computer Applications:

More information

CAIM: Cerca i Anàlisi d Informació Massiva

CAIM: Cerca i Anàlisi d Informació Massiva 1 / 72 CAIM: Cerca i Anàlisi d Informació Massiva FIB, Grau en Enginyeria Informàtica Slides by Marta Arias, José Balcázar, Ricard Gavaldá Department of Computer Science, UPC Fall 2016 http://www.cs.upc.edu/~caim

More information

Integrated Analysis of Gene Expression and Other Data

Integrated Analysis of Gene Expression and Other Data Analysis of DNA Chips and Gene Networks Fall Semester, 2010 Lecture 14: January 21, 2010 Lecturer: Prof. Ron Shamir Scribe: David Ze evi Integrated Analysis of Gene Expression and Other Data 14.1 Introduction

More information

Data Mining in Bioinformatics: Study & Survey

Data Mining in Bioinformatics: Study & Survey Data Mining in Bioinformatics: Study & Survey Saliha V S St. Joseph s college Irinjalakuda Abstract--Large amounts of data are generated in medical research. A biological database consists of a collection

More information

An overview of Graph Categories and Graph Primitives

An overview of Graph Categories and Graph Primitives An overview of Graph Categories and Graph Primitives Dino Ienco (dino.ienco@irstea.fr) https://sites.google.com/site/dinoienco/ Topics I m interested in: Graph Database and Graph Data Mining Social Network

More information

GRAPH DB S & APPLICATIONS

GRAPH DB S & APPLICATIONS GRAPH DB S & APPLICATIONS DENIS VRDOLJAK GUNNAR KLEEMANN UC BERKELEY SCHOOL OF INFORMATION BERKELEY DATA SCIENCE GROUP, LLC PRESENTATION ROAD MAP Intro Background Examples Our Work Graph Databases Intro

More information

Theme Identification in RDF Graphs

Theme Identification in RDF Graphs Theme Identification in RDF Graphs Hanane Ouksili PRiSM, Univ. Versailles St Quentin, UMR CNRS 8144, Versailles France hanane.ouksili@prism.uvsq.fr Abstract. An increasing number of RDF datasets is published

More information

The Betweenness Centrality Of Biological Networks

The Betweenness Centrality Of Biological Networks The Betweenness Centrality Of Biological Networks Shivaram Narayanan Thesis submitted to the Faculty of the Virginia Polytechnic Institute and State University in partial fulfillment of the requirements

More information

Comparison of Centralities for Biological Networks

Comparison of Centralities for Biological Networks Comparison of Centralities for Biological Networks Dirk Koschützki and Falk Schreiber Bioinformatics Center Gatersleben-Halle Institute of Plant Genetics and Crop Plant Research Corrensstraße 3 06466 Gatersleben,

More information

Mathematics of Networks II

Mathematics of Networks II Mathematics of Networks II 26.10.2016 1 / 30 Definition of a network Our definition (Newman): A network (graph) is a collection of vertices (nodes) joined by edges (links). More precise definition (Bollobàs):

More information

Metasearch Process for Transcription Targets

Metasearch Process for Transcription Targets Step 1 Select 'Genes' This is the primary interface for the Metasearch add on to Thomson Reuters (GeneGO) platform. Metasearch allows one to make complex queries for information extraction. This document

More information

Supplementary Materials for. A gene ontology inferred from molecular networks

Supplementary Materials for. A gene ontology inferred from molecular networks Supplementary Materials for A gene ontology inferred from molecular networks Janusz Dutkowski, Michael Kramer, Michal A Surma, Rama Balakrishnan, J Michael Cherry, Nevan J Krogan & Trey Ideker 1. Supplementary

More information

Drug versus Disease (DrugVsDisease) package

Drug versus Disease (DrugVsDisease) package 1 Introduction Drug versus Disease (DrugVsDisease) package The Drug versus Disease (DrugVsDisease) package provides a pipeline for the comparison of drug and disease gene expression profiles where negatively

More information

Network generation and analysis. through Cytoscape and PSICQUIC

Network generation and analysis. through Cytoscape and PSICQUIC (v6, 6/6/13) Network generation and analysis through Cytoscape and PSICQUIC Author: Pablo Porras Millán IntAct Scientific Database Curator This work is licensed under the Creative Commons Attribution-Share

More information

OPEN MP-BASED PARALLEL AND SCALABLE GENETIC SEQUENCE ALIGNMENT

OPEN MP-BASED PARALLEL AND SCALABLE GENETIC SEQUENCE ALIGNMENT OPEN MP-BASED PARALLEL AND SCALABLE GENETIC SEQUENCE ALIGNMENT Asif Ali Khan*, Laiq Hassan*, Salim Ullah* ABSTRACT: In bioinformatics, sequence alignment is a common and insistent task. Biologists align

More information

High throughput Data Analysis 2. Cluster Analysis

High throughput Data Analysis 2. Cluster Analysis High throughput Data Analysis 2 Cluster Analysis Overview Why clustering? Hierarchical clustering K means clustering Issues with above two Other methods Quality of clustering results Introduction WHY DO

More information

EGAN Tutorial: A Basic Use-case

EGAN Tutorial: A Basic Use-case EGAN Tutorial: A Basic Use-case July 2010 Jesse Paquette Biostatistics and Computational Biology Core Helen Diller Family Comprehensive Cancer Center University of California, San Francisco (AKA BCBC HDFCCC

More information

Structural Bioinformatics

Structural Bioinformatics Structural Bioinformatics Elucidation of the 3D structures of biomolecules. Analysis and comparison of biomolecular structures. Prediction of biomolecular recognition. Handles three-dimensional (3-D) structures.

More information

NetWalker Genomic Data Integration Platform. User Guide

NetWalker Genomic Data Integration Platform. User Guide NetWalker Genomic Data Integration Platform User Guide Table of Contents NetWalker Genomic Data Integration Platform... 0 General Object Structure and software layout... 1 1. NetWalker Interactome Knowledgebase...

More information

EECS730: Introduction to Bioinformatics

EECS730: Introduction to Bioinformatics EECS730: Introduction to Bioinformatics Lecture 15: Microarray clustering http://compbio.pbworks.com/f/wood2.gif Some slides were adapted from Dr. Shaojie Zhang (University of Central Florida) Microarray

More information

Logic: The Big Picture. Axiomatizing Arithmetic. Tautologies and Valid Arguments. Graphs and Trees

Logic: The Big Picture. Axiomatizing Arithmetic. Tautologies and Valid Arguments. Graphs and Trees Axiomatizing Arithmetic Logic: The Big Picture Suppose we restrict the domain to the natural numbers, and allow only the standard symbols of arithmetic (+,, =, >, 0, 1). Typical true formulas include:

More information

Transfer String Kernel for Cross-Context Sequence Specific DNA-Protein Binding Prediction. by Ritambhara Singh IIIT-Delhi June 10, 2016

Transfer String Kernel for Cross-Context Sequence Specific DNA-Protein Binding Prediction. by Ritambhara Singh IIIT-Delhi June 10, 2016 Transfer String Kernel for Cross-Context Sequence Specific DNA-Protein Binding Prediction by Ritambhara Singh IIIT-Delhi June 10, 2016 1 Biology in a Slide DNA RNA PROTEIN CELL ORGANISM 2 DNA and Diseases

More information

Centralities for undirected graphs

Centralities for undirected graphs Chapter 4 Centralities for undirected graphs The next step in network analysis is to define, using mathematical formalisms, the different features we want to compute. So we present indexes, called centralities,

More information

Discovery Net : A UK e-science Pilot Project for Grid-based Knowledge Discovery Services. Patrick Wendel Imperial College, London

Discovery Net : A UK e-science Pilot Project for Grid-based Knowledge Discovery Services. Patrick Wendel Imperial College, London Discovery Net : A UK e-science Pilot Project for Grid-based Knowledge Discovery Services Patrick Wendel Imperial College, London Data Mining and Exploration Middleware for Distributed and Grid Computing,

More information

Advanced UCSC Browser Functions

Advanced UCSC Browser Functions Advanced UCSC Browser Functions Dr. Thomas Randall tarandal@email.unc.edu bioinformatics.unc.edu UCSC Browser: genome.ucsc.edu Overview Custom Tracks adding your own datasets Utilities custom tools for

More information

V10 Metabolic networks - Graph connectivity

V10 Metabolic networks - Graph connectivity V10 Metabolic networks - Graph connectivity Graph connectivity is related to analyzing biological networks for - finding cliques - edge betweenness - modular decomposition that have been or will be covered

More information

Case Studies in Complex Networks

Case Studies in Complex Networks Case Studies in Complex Networks Introduction to Scientific Modeling CS 365 George Bezerra 08/27/2012 The origin of graph theory Königsberg bridge problem Leonard Euler (1707-1783) The Königsberg Bridge

More information

/ Computational Genomics. Normalization

/ Computational Genomics. Normalization 10-810 /02-710 Computational Genomics Normalization Genes and Gene Expression Technology Display of Expression Information Yeast cell cycle expression Experiments (over time) baseline expression program

More information

Introduction III. Graphs. Motivations I. Introduction IV

Introduction III. Graphs. Motivations I. Introduction IV Introduction I Graphs Computer Science & Engineering 235: Discrete Mathematics Christopher M. Bourke cbourke@cse.unl.edu Graph theory was introduced in the 18th century by Leonhard Euler via the Königsberg

More information

8/19/13. Computational problems. Introduction to Algorithm

8/19/13. Computational problems. Introduction to Algorithm I519, Introduction to Introduction to Algorithm Yuzhen Ye (yye@indiana.edu) School of Informatics and Computing, IUB Computational problems A computational problem specifies an input-output relationship

More information

The genexplain platform. Workshop SW2: Pathway Analysis in Transcriptomics, Proteomics and Metabolomics

The genexplain platform. Workshop SW2: Pathway Analysis in Transcriptomics, Proteomics and Metabolomics The genexplain platform Workshop SW2: Pathway Analysis in Transcriptomics, Proteomics and Metabolomics Saturday, March 17, 2012 2 genexplain GmbH Am Exer 10b D-38302 Wolfenbüttel Germany E-mail: olga.kel-margoulis@genexplain.com,

More information

Tutorial:Introduction to Cytoscape

Tutorial:Introduction to Cytoscape Tutorial:Introduction to Cytoscape 1 Tutorial:Introduction to Cytoscape Cytoscape is an open source software platform for integrating, visualizing, and analyzing measurement data in the context of networks.

More information

2. Take a few minutes to look around the site. The goal is to familiarize yourself with a few key components of the NCBI.

2. Take a few minutes to look around the site. The goal is to familiarize yourself with a few key components of the NCBI. 2 Navigating the NCBI Instructions Aim: To become familiar with the resources available at the National Center for Bioinformatics (NCBI) and the search engine Entrez. Instructions: Write the answers to

More information

Network Based Models For Analysis of SNPs Yalta Opt

Network Based Models For Analysis of SNPs Yalta Opt Outline Network Based Models For Analysis of Yalta Optimization Conference 2010 Network Science Zeynep Ertem*, Sergiy Butenko*, Clare Gill** *Department of Industrial and Systems Engineering, **Department

More information

Social Network Analysis With igraph & R. Ofrit Lesser December 11 th, 2014

Social Network Analysis With igraph & R. Ofrit Lesser December 11 th, 2014 Social Network Analysis With igraph & R Ofrit Lesser ofrit.lesser@gmail.com December 11 th, 2014 Outline The igraph R package Basic graph concepts What can you do with igraph? Construction Attributes Centrality

More information

What can we do today that we couldn t do before.

What can we do today that we couldn t do before. BIG DATA DISCOVERY SCIENCE Big Data for Discovery Science (BDDS) What can we do today that we couldn t do before. Arthur Toga, PI Sept. 30, 2016 Santa Rosa, CA 10/6/2016 BIG DATA DISCOVERY SCIENCE 2 BIG

More information

Tutorial for the Exon Ontology website

Tutorial for the Exon Ontology website Tutorial for the Exon Ontology website Table of content Outline Step-by-step Guide 1. Preparation of the test-list 2. First analysis step (without statistical analysis) 2.1. The output page is composed

More information

Boolean regulatory network reconstruction using literature based knowledge with a genetic algorithm optimization method

Boolean regulatory network reconstruction using literature based knowledge with a genetic algorithm optimization method Dorier et al. BMC Bioinformatics (2016) 17:410 DOI 10.1186/s12859-016-1287-z METHODOLOGY ARTICLE Open Access Boolean regulatory network reconstruction using literature based knowledge with a genetic algorithm

More information

NSGA-II for Biological Graph Compression

NSGA-II for Biological Graph Compression Advanced Studies in Biology, Vol. 9, 2017, no. 1, 1-7 HIKARI Ltd, www.m-hikari.com https://doi.org/10.12988/asb.2017.61143 NSGA-II for Biological Graph Compression A. N. Zakirov and J. A. Brown Innopolis

More information