Professional Documents
Culture Documents
(Bio)Chemistry Concentrations Molecular structures Reaction equations Quantitative Defined experimental setup
Molecular Biology Regulation Large biomolecules Large scale processes Qualitative Complex experimental setup (by necessity!)
Human Genetics
Capturing complexity
integration graphs
Human Genetics
Human Genetics
Human Genetics
Human Genetics
Human Genetics
Human Genetics
Human Genetics
Cytoscape Workflow
1. Load Networks (Import network data into Cytoscape) 2. Load Attributes (Get data about networks into Cytoscape) 3. Analyze and Visualize Networks 4. Prepare for Publication A specific example of this workflow:
Cline, et al. Integration of biological networks and gene expression data using Cytoscape, Nature Protocols, 2, 2366-2382 (2007).
Human Genetics
Networks as graphs
A Network is a collection of
Nodes (or vertices) Edges connecting nodes (directed or undirected, weighted, multiple edges, self-edges) Nodes can represent proteins, genes, metabolites, or groups of these (e.g. complexes) - any sort of object Edges can be either physical or functional interactions, activators, regulators, reactions - any sort of relations
Human Genetics
Cytoscape Workflow
1. Load Networks (Get network data into Cytoscape) 2. Load Attributes (Get data about networks into Cytoscape) 3. Analyze and Visualize Networks 4. Prepare for Publication
Human Genetics
Creating a network
Human Genetics
Preview
Human Genetics
Functional interactions
Co-expression relations Genetic interactions Knockout/siRNA targets
Human Genetics
Available for download from data sources (URLs, web-services, formatted table files)
Human Genetics
Internet Databases
Cytoscape version 2.6
web service clients: import networks directly from several trusted internet resources IntAct (MBL-EBI) PathwayCommons (collection of data resources) NCBI Entrez Gene Many more will be included...
Human Genetics
Human Genetics
Cytoscape Workflow
1. Load Networks (Get network data into Cytoscape) 2. Load Attributes (Get data about networks into Cytoscape) 3. Analyze and Visualize Networks 4. Prepare for Publication
Human Genetics
Attribute Management
Select Attributes for Display
Node or Edge ID
ID Mapping
Mapping identifiers from one source to another is a major challenge Multiple levels of IDs E.g. probe>gene ->peptide->protein Cytoscape provides an ID mapping through the BioMart web service of EBI to convert the IDs Not perfect but sufficient Additional mapping mechanism underway
Human Genetics
Cytoscape Workflow
1. Load Networks (Get network data into Cytoscape) 2. Load Attributes (Get data about networks into Cytoscape) 3. Analyze and Visualize Networks 4. Prepare for Publication
Human Genetics
VizMapper
2. Attribute Data
ExpressionValue YCL040W = 0.542 YDL130W = -0.123 YDR382W = -0.058 YFL039C = 0.192 YHR179W = 0.078
Human Genetics
VizMapper
List of Visual Styles Default Visual Style Editor List of Visual Attributes
Mapping definition
Human Genetics
Types of mappings
Continuous Continuous Data mapped to Continuous Visual Attributes (e.g. gene expression levels mapped to node color) Continuous Data mapped to Discrete Visual Attributes (e.g. p-value categories mapped to node shape) Discrete Discrete (categorical) Data to Discrete Visual Attributes (e.g. GO annotation mapped to node shape) Discrete Data mapped to Continuous Visual Attributes(e.g. multiple GO terms mapped to pie coloring)
Human Genetics
Network Filtering
Human Genetics
Spring-embedded
Circular Hierarchical
Human Genetics
Linkout
Nodes and Edges act as hyperlinks to external databases. Userconfigurable URLs Collection of the biological results
Human Genetics
Cytoscape Workflow
1. Load Networks (Get network data into Cytoscape) 2. Load Attributes (Get data about networks into Cytoscape) 3. Analyze and Visualize Networks 4. Prepare for Publication
Human Genetics
Human Genetics
Cytoscape: So what?
The big Pro Cyto argument: EXTENSIBLE Plugins, Plugins, Plugins
In our case enabled extended array data analysis
Human Genetics
Cytoscape is Extensible
Cytoscape is open source and free software A plugin interface that allows any programmer to write their own extensions to Cytoscape Plugins represent the primary biological analysis mechanism in Cytoscape Plugins are distributed from a central Cytoscape database and can be installed while running
Human Genetics
http://cytoscape.org/cgi-bin/moin.cgi/Hello_World_Plugin http://cytoscape.org/cgi-bin/moin.cgi/Developer_Homepage
Human Genetics
Human Genetics
DB
Cytoscape webstart
AMC Plugin
Cytoscape interface
Canonical paths
R2 interface: Demo
Human Genetics
R2 interface
Human Genetics
R2 interface
Human Genetics
R2 interface
Human Genetics
R2 interface
Human Genetics
R2 interface
Human Genetics
Human Genetics
Timeseries in R2
Human Genetics
Timeseries in R2
Human Genetics
Human Genetics
Human Genetics
Human Genetics
Human Genetics
Human Genetics
Timeseries in Cytoscape
Human Genetics
Timeseries in Cytoscape
Human Genetics
Human Genetics
Filtering
Human Genetics
Filtering
Human Genetics
Coloring, layout
Human Genetics
Resuming:
1. Aggregate 2. Search/Visualize 3. Analyze/ Feedback Combine NOTCH3 knockout data with TF and PPi data Layout timeseries/Find downstream targets Identify MSX1/Knockout in new experiment
Human Genetics
Human Genetics
Human Genetics
Human Genetics
Human Genetics
Human Genetics
Default settings for the class attractive force and separation factor Class-directed network layout
Human Genetics
Network clusters are highly interconnected sub-networks that may be also partly overlapping Clusters in a protein-protein interaction network have been shown to represent protein complexes and parts of biological pathways Clusters in a protein similarity network represent protein families
Human Genetics
Network Clustering
7000 Yeast interactions among 3000 proteins
Human Genetics
Human Genetics
Proteasome 26S
Ribosome
Proteasome 20S
Human Genetics
Network motif is a sub-network that occurs significantly more often than by chance alone Input: query and target networks, optional node/edge labels Output: topological query matches as subgraphs of target network Supports: subgraph matching, node/edge labels, label wildcards, approximate paths http://alpha.dmi.unict.it/~ctnyu/netmatch.html
Human Genetics
Query
Ferro et al. Bioinformatics 2007
Human Genetics
Results
NetMatch Results
Signaling pathway example NetMatch query
Ras
Raf-1
Shortest path between subgraph matches
Human Genetics
Mek MAPK
Method
Calculate z-score/node, ZA score/subgraph, correct for random expression data sampling Score over multiple experimental conditions Simulated annealing-based search method is used to find the high scoring networks
Ideker T, Ozier O, Schwikowski B, Siegel AF Human Genetics Bioinformatics. 2002;18 Suppl 1:S233-40
Output: significant subnetworks that show differential expression over one or several conditions
Human Genetics
Human Genetics
Concluding
Cytoscape is a proven valuable tool for integrative bioinformatics Easily extensible: well suited to answer new biological research questions Analyses can be tedious for biologists; up to bioinformaticians to translate these in simple workflows Therefore: bioinformaticians, integrate into wet-lab research groups!
Human Genetics
Some notes
Plugin lifetime
Maintenance Interoperability
Visualization issues
Standard biologist layouts Fancy visuals
Availability
Cytoscape:
http://cytoscape.org cytoscape-discuss@googlegroups.com cytoscape-helpdesk@googlegroups.com
R2
Available shortly through http://humangeneticsamc.nl Keep yourself posted on http://groups.google.com/group/r2-announce Human Genetics