Oct 21 2010 Feedback.
Contents
 
Introduction
How To Guide
Getting Started
Molecular Graphics
Slides and Documents
ActiveICM
Movie Making
Sequences & Alignments
Protein Structure Analysis
Protein Superposition
Crystallographic Analysis
Homology & Modeling
3D Predict
Molecular Mechanics
Cheminformatics
Chemsitry Menu
Docking
Ligand Editor
Tables
 Standard
 Molecular Tables
 Plot
 PCA
 PLS and Principal Component regression and classification
 Cluster
  Tree Select
  Save and Print
  Tree View
Local Databases
FAQs
Tutorials
 
Index
PrevICM User's Guide
19.6 Cluster
Next

[ Tree Select | Save and Print | Tree View ]

To perform clustering based on chemical similarity or any other data you must have an SDF file or table loaded into ICM.

  • Right click on the table and select the Clustering option OR select the Chemistry menu and choose the option entitled Cluster Set.
OR
  • Tools/Table/Clustering.

A data entry box as shown below will be displayed.

  • Select which table from the drop down menu you wish to cluster.

There are two clustering methods:

  1. A rigorous tree approach (advice to use this with 10000 compounds or less). This clustering algorithm consists of 2 steps: calculation of distance matrix (based on chemical fingerprints for chemical data) and the hieararchical clustering itself. Usually most time is being spent on first step.
  2. A less rigorous K-means approach. This option is quicker but the generated tree is not detalized down to the level of table rows. The elements within the table are colored and numbered according to their clustered group.

If your computer has enough memory we recomend you use the TREE method. It takes ~6 minutes to cluster 10000 compounds on a standard computer with 512Mb of memory.

NOTE If you wish to cluster a selection of a table; first select the compounds you wish to cluster (Ctrl A will select all or see the section of the manual entitled making table selections.

When using the TREE method select which linkage type you wish to use:

  • UPGMA (unweighted pair group method using averages): Distance calculated is the average of all elements (recommended).
  • Single linkage: Nearest neighbour linkage
  • Complete linkage: Furthest neighbour linkage
  • WPGMA (weighted pair group method using averages): Rough approximation of weighted (slightly faster)

Check the option "Keep Distance Matrix" if you wish the Distance Matrix to be saved.

  • Click the RUN button and the tree and table will be displayed as shown below.

19.6.1 Tree Selection


A number of different selections can be made once the tree has been created.

  • Click in free area unselects all.
  • Click on node selects node and all its heirs.
  • Ctrl+click adds to (if not selected yet) or removes from (if already in selection) current selection.
  • Shift plus click allows a range to be selected.
  • Click in label zone selects node.
  • Keyboard "Up"/"Down" cursor keys move selected node up/down in the tree. With Shift held the selection expands in the according direction.
  • Keyboard "Escape" unselects all.

Select Center Representatives From a Node.

To select the "center" representative compound or value from a cluster.

  • Right click in the free area of the tree and select the option "Select Centers".

NOTE: Selections in the tree will be highlighted in blue in the tree and in the table.

Copy selection to new table.

All selections can be copied to a new table by:

  • Right clicking on the selected rows in table and a menu will be displayed.
  • Selecting the "copy selection to ICM table" option.

19.6.2 Save and Print Tree


To save a tree

Option 1:

Option 2:

  • Save the table as an .icb file. Right click on table header and select "save as".

To save a tree as a picture

  • Right click in the "free area".
  • Select "Save Image..."

The image of the tree can also be saved to the clipboard.

  • Right click in the "free area".
  • Select "Copy Image to Clipboard"

To print the tree

  • Right click in the "free area".
  • Select "Print"

19.6.3 Tree View


A number of changes to the tree view can be made by right clicking in the tree "free area" and selecting preferences as shown below.

A preferences window as shown below will be displayed.

Reverse Growth:

Distance Range:

Color Preferences:

Left: Fill Right: Color Lines

Left: Color lines to parent Right: Only labels

Increase or Decrease Line Spacing:

Increase or Decrease Line Width:

Change Font and Margins:

Change Label

To change the tree labeling (GLOBAL):

* Right click in the "free area" and select the Label/Sort option.

A window as shown below will be displayed:

  • Choose the column name you wish to label the elements of your tree.
  • Click the Append to Format button. You may wish to delete some of the information in the Format data entry box if you do not wish that column header to be displayed. Note more than one column header can be used as a label.
  • Click the update labels button to view the changes.

To change the tree labeling (NODE):

  • Right click on the node you wish to change the label of and select "Edit Label" and a data entry box as shown below will be displayed.

  • Enter the new label.

Change Node Coloring

To change the color of a node:

  • Right click on the node you wish to change the color of and select "Edit Color".
  • Select the desired color and click OK.


Prev
Clustering
Home
Up
Next
Standard

Copyright© 1989-2010, Molsoft,LLC - All Rights Reserved.
This document contains proprietary and confidential information of Molsoft, LLC.
The content of this document may not be disclosed to third parties, copied or duplicated in any form,
in whole or in part, without the prior written permission from Molsoft, LLC.