Abs

# https://www.epa.gov/sites/production/files/2015-05/documents/moleculardescriptorsguide-v102.pdf

array.

Count ( I|R|S_array unique | identity ) → I

returns an integer array with integer id for sequentially identical values. Example:


group table t {"d","d","d","bb","bb","a","a","a"} 
add column t Count(t.A unique ) Count(t.A identity ) name={ "unique","identity" }
show t
 #>T t
 #>-A-----------unique------identity---
    d           1           1          
    d           1           2          
    d           1           3          
    bb          2           1          
    bb          2           2          
    a           3           1          
    a           3           2          
    a           3           3

CubicRoot

CubicRoot( r ) → r_cubic_root

CubicRoot( r [ r_im ] ) → R6_3re+3im

Example:


CubicRoot(27. )
  3.
CubicRoot(27. 0.) 
 #>R 
   3.
   -1.5
   -1.5
   0.
   -2.598076
   2.598076

Deletion

Deletion ( rs_Fragment, ali_Alignment [, seq_fromAli ] [, i_addFlanks ] [{"all"|"nter"|"cter"|"loop"}] )

residue selection

ali_Alignment.

seq_fromAli

rs_Fragment

seq_fromAli

linked

Insertion

indels

i_addFlanks

"all" (default: no string option) select deletions of all types
"nter" select only N-terminal fragments
"cter" select only C-terminal fragments
"loop" select only the internal zones of deleted loops

Insertion

Descriptor (topological and numericals)

Descriptor ( chemArray number )

- returns table of various numerical descriptors of the following categories:

Atom counts
- a_count - total atom count
- a_heavy - heavy atom count
- a_aro - aromatic atom count
- HB_a - hydrogen bond acceptor count
- HB_don - hydrogen bond donor count
- Individual atom counts
  - a_nH a_nC a_nN a_nO a_nF a_nP a_nS a_nCl a_nBr
Bond counts
- b_count - total number of bonds
- b_heavy - number of bonds between heavy atoms
- b_rotN - number of non-ring bonds
- b_rotR - fraction of non-ring bonds
- b_1rotN - number of single non-ring bonds
- b_1rotR - fraction of single non-ring bonds
- b_ar - number of aromatic bonds
- b_single - number of single bonds
- b_double - number of double bonds
- b_triple - number of triple bonds
- b_rot - number of aromatic bonds
Topological Descriptors ([Hall 1991] and [Hall 1997])
- Definitions
  - n[i] : atomic number of atom i
  - d[i] : number of heavy connections for atom i
  - v[i] : nV[i] - nH[i] for atoms with atomic number <= 10, (nV[i] - nH[i]) / (n[i] - nV[i] - 1) for atoms with atomic number > 10
  - P[i] : number of paths of bond length i in hydrogen suppressed molecule
  - N : number of non-hydrogen atoms
  - R[i] : van der Waals radius of atom i
  - Rc : van der Waals radius of carbon atom
  - α : 1 - Sum( R[i] / Rc )
  - Na : N + α
  - Pa[i] : P[i] + α
- Chi Connectivity Indices
  - t_chi0 : Atomic connectivity index order 0. Sum(1/sqrt(d[i]))
  - t_chi0_C : Carbon connectivity index order 0. (Same as above calculated for carbon atoms)
  - t_chi1 : Atomic connectivity index order 1. Sum(1/sqrt(d[i]*d[j])) for all bonds between heavy atoms i and j.
  - t_chi1_C : Carbon connectivity index order 1. (Same as above calculated for carbon atoms)
  - t_chi0v : Atomic valence connectivity index order 0. Sum(1/sqrt(v[i]))
  - t_chi0v_C : Carbon valence connectivity index order 0. (Same as above calculated for carbon atoms)
  - t_chi1v : Atomic valence connectivity index order 1. Sum(1/sqrt(v[i]*v[j])) for all bonds between heavy atoms i and j.
  - t_chi1v_C : Atomic valence connectivity index order 1. (Same as above calculated for carbon atoms)
- Kappa Shape Indices
  - k1 : First kappa shape index: N(N-1)^2 / P[1]^2
  - k2 : Second kappa shape index: (N-1)(N-2)^2 / P[2]^2
  - k3 : Third kappa shape index: (N-1)(N-3)^2 / P[3]^3 for odd N, (N-3)(N-2)^2 / P[3]^2 for even N
  - k1a : First kappa shape index α: Na(Na-1)^2 / Pa[1]^2
  - k2a : Second kappa shape index: (Na-1)(Na-2)^2 / Pa[2]^2
  - k3a : Third kappa shape index: (Na-1)(Na-3)^2 / Pa[3]^3 for odd N, (Na-3)(Na-2)^2 / Pa[3]^2 for even N
- zagreb : Sum(d[i]*d[i]) for all non-hydrogen atoms
- weiner : Sum(D(i,j)) for all non-hydrogen atom pairs i,~~j. D(i,j) - topological distance between atoms i and j.
- maxdist : Max(D(i,j)) for all atom pairs i,~~j. D(i,j) - topological distance between atoms i and j.

Example:


read table s_icmhome + "celebrex50.sdf" name="t"
add column t Descriptor( t.mol number )

Descriptor

Descriptor ( chemArray )

Descriptor ( chemArray collection_of_Fingerprint_Parameters [info] )

- returns vector of binary fingerprints, default or custom, calculated for each chemical.

The collection_of_Fingerprint_Parameters argument is a collection which defines parameters for fingerprint generation and consists of the following members:

ATMAP: string with comma separate atom properties descriptions. Example: cd,h
BOMAP: string with comma separate bond properties description. Example: bt,r
SIZE : result vector size
LEN : maximum fragment/chain length
TYPE : fingerprint type: "linear","triplets","ecfp"
BINARY: yes/no (no - counted fingerprint mode)
ECFPITER: (ecfp only) number of iterations. 1 - ECFP2, 2 - ECFP3, 3 - ECFP4, etc...

Examples:


  # export default binary fingerprints
  write sarray Sarray(Descriptor(t.mol)) name="fp.txt"


# linear counted fingerprint, SIZE=1024, max chain length=5 
Descriptor( t.mol, Collection("ATMAP" "cd,h" "SIZE" 1024 "BOMAP" "bt,r" "LEN" 5 "TYPE" "linear", "BINARY", no) )
# ecfp4 fingerprint
Descriptor( t.mol, Collection("ATMAP" "cd,h" "SIZE" 2048 "BOMAP" "bt" "LEN" 999 "TYPE" "ecfp", "ECFPITER" 3, "BINARY", no)  )

Example in which fingerprints returned by the function are directly used in distance calculation:


add column t Chemical({"CCN","CCCN"})
mod = Collection()
mod['FP'] = Descriptor( t.mol )
Distance(mod['FP'], Descriptor(Chemical("C(=O)CCN")))

Descriptor ( chemArray predModel )

- returns vector of rarrays with chemical descriptors calculated for each chemical. each rarray consists of chemical fingerprint part and values for columns with formula used in the predModel.

This information can be used for further analysis or exported outside ICM.

Example:


 # assumes that 'clogPpred' is a prediction model 
 tt = Table( Transpose( Matrix( Descriptor( Chemical("CCC"), clogPpred ) ) ))
 add column tt Name( clogPpred column )
 sort reverse tt.A

To find the description of the each particular position in the rarray Name function can be used.

Example:


rr = Descriptor( Chemical("CCC") myModel )[1]
na = Name( myModel column )
for i=1,Nof(rr)
  if (rr[i] != 0) print rr[i], na[i]
endfor

Disgeo

"DIStance GEOmetry"

 
 group sequence se1 se2 se2 se4 mySeqs 
 align mySeqs 
 distMatr=Distances(mySeqs)

Disgeo ( matrix )

[1:n

[i]

[i].

R_out returns four numbers: total negative eigen values, and the first 3 largest positive eigenvalues. All scaled to 100%.

 
 read sequences s_icmhome+"zincFing"   # read sequences from the file, 
 list sequences              # see them, then ...  
 group sequence alZnFing     # group them, then ...  
 align alZnFing              # align them, then ...  
 a=Distance(alZnFing)        # a matrix of pairwise distances  
 n=Nof(a)                    # number of points  
 b=Disgeo(a)                 # calculate principal components  
 corMat=b[1:n,1:n-1]         # coordinate matrix [n,n-1] of n points  
 eigenV=b[1:n,n]             # vector with n sorted eigenvalues  
 xplot= corMat[1:n,1] 
 yplot= corMat[1:n,2] 
 plot xplot yplot CIRCLE display # call plot a 2D distribution

Distance

Distance( II | RR | as as | seq seq ) → r_dist

Distance( S|s, s) → R|r

Distance( ali ali [exact] ) → r

Distance( S S [simple]) → M

Distance( Sn_hier_codes Sm_hier_codes tree [ s_delimeter ]) → M_nm_0to1

Distance( Mnk ) → Mnn_cart_dist_between_row-vectors

Distance( Mnk Mmk ) → Mnm

Distance( M_xyz|as M_xyz|as r_dist ) → l_yes_if_closer_than_dist

Distance( seq seq nucleotide [len] )

Distance( seqArr[n]> ) → <M_nn

Distance( ali seq [string] ) → R_n_seq_in_ali

Distance( seqArr[n]> <seq ) → R_n

Distance( seqArr[n]> <seqArr[m]> ) → <M_nm

Distance( as [r_default=-1.] ) → R_tether_lengths_or_def

Distance( as_n as_m ) → d_between_centers_of_mass

Distance( as_n as_m all ) → R_nm

Distance( as_n as_n rarray ) → R_n # aligned arrays, same n

Distance( ali [0] ) → M_interSeqDist

Distance( X_n [X_m] [pharmacophore|sstructure] ) → M_nxm_chemical_Tanimoto_distances

Distance( X_n X_m [[R_Wn R_Wm] r_maxdist (0.4) [r_steepness(6.)]] set ) → r_set1_2_distance

Distance( bitvecArr[n]> <bitvecArr[m]> ) → <M_nm #tanimoto, see Descriptor function

Distance( I_keys1 I_keys2 i_nBits|R_nbitWeights [simple] ) → M : Tanimoto|weighted

Distance( tree [i_at=1] split ) → r_splitLevel

Distance( tree all|modify ) → R_splitLevels|splitLevelTStats

Distance( g wire|grid [i_maxDist(1000000)>] ) → <M_shortestPaths

Distance( d_0to1|M evolution ) → d|M_Dayhoff_correction_applied

See detailed descriptions below.

Distance between iarrays

Distance ( iarray1, iarray2 )
- returns the real sqrt of sum of (I1_i -I2_i )² .

Distance between vectors

Distance ( R_X, R_Y ) - returns the real Cartesian distance between two vectors of the same length. D = Sum( ( X_i - Y_i )² )

Distance ~~as_

Distance ( as_1, as_2 [ all ] )

- returns the real distance in Angstroms between centers of mass of the two specified selections. The interactive usage of this function: Option all will return an array of all cross distances between the selections. The selected virtual atoms will be skipped if the selection level residue, molecule or object. Othewise, if you explicitly select virtual atoms, they will be included, e.g.


 build string "ala" # contains 2 virtual atoms at N terminus
 build string "his" # also contains 2 virtual atoms at N terminus
 Distance( a_1. a_2. all ) # no virtual atom distances
 Distance( a_1.// a_2.// all ) # selected virtual atoms are included

 Distance( a_1. a_2. ) # a single distance between centers of mass

Distance ~~as_ rarray

Distance ( as_1 , as_2, rarray )

- returns the rarray of distances in Angstroms between the two specified selections containing the same number of atoms (1-1, 2-2, 3-3, ...).

See also: Distance ( as1 as2 all )

Distance matrix

Distance ( M_coor ) - returns the square matrix of distances between the rows of the input matrix M_coor. Each row contains m coordinates (3 in 3D space). For example: Distance(Xyz(a_//ca)) returns a square matrix of Ca-Ca distances.

Distance between string arrays of hierarchical labels

Distance( Sn_hier_codes Sm_hier_codes tree [s_delimeter]) → M_nm

compares two arrays of hierarchical labels like this: "clan.family.subfamily.." The labels can be delimiter separated, or just strings of the same length where one positions is just one character (the default). Returns a distance matrix normalized to [0:1] range.

Here are examples of classification codes that can be used:

EC numbers, enzymes: eg Distance({"6.2.1.11"},{"6.2.1.25"}, tree, ".")
ATC drug codes, eg Distance({"C-03-C-C-01"},{"L-01-A-B-03"}, tree, "-") , use Replace to insert dashes between sections
AHFS drug codes, eg Distance({"08:18.32"},{"08:16.00","92:00.00"}, tree, ":.")
...

Examples:


add column t {"Cocaine","Betaxolol"} {"52:16.00","52:92.00"}
show Distance(t.B t.B tree ":.")
 
add column tt {"Cocaine","Betaxolol"} {"AB2","ABc"}
show Distance(tt.B tt.B tree ) # use each character as level

These distance matrices can be used for making 2D and 3D graphs, or to plot clustering trees of tables containint suitable label columns, eg see ds3D make tree object .

Tanimoto distance between two arrays of bit-strings

Distance( X_chem_n X_chem_m ) → M_nxm_distances

Distance( I_keys1 I_keys2 nBits | R_nBitWeights [simple] ) → M_distances
- returns the matrix of Tanimoto distances between two arrays of bit-strings. Each array of N-strings is represented by an iarray I_keys of N*( nBits/32 ) elements (e.g. if nBits is 32 , each integer represents 1 bit-string, if nBits i 64, I_keys1 has two integers for each bit string, etc.). The returned matrix dimensions are N1 x N2 . The distance is defined as 1. - similarity , where The Tanimoto similarity between bitstrings is defined as follows: The number of the on-bits in-common between two strings divided by the number of the on-bits in either bit-string.
You can provide a relative weight for each bit in a bit-string as a rarray R_weights. In this case the weighted Tanimoto distance is calculated as follows:

 
  distWeighted = 1. - Sum( Wi_of_common_On_Bits ) / Sum( Wi_of_On_Bits )

simple

second

 
Distance({3} {1} 32 simple ) # returns 0. 
Distance({1} {3} 32 simple ) # returns 0.5

 
Distance({1 2 3},{1 2 3},32) 
 #>M 
 0. 1. 0.5 
 1. 0. 0.5 
 0.5 0.5 0.

See also:

Distance(X, X, .., set) to calculate a single distance between two chemical sets
Score(X, X, .., set)
Iarray-bits-to-integers{ Iarray({1 0 0 1 1 0 ..} key ) } to generate compressed integer bit vectors

Similarity score between two sets of ligands

Distance( X_n X_m [[R_Wn R_Wm] r_minScore (0.3) [r_steepness(6.)]] set ) → r_distance [0:1]

retuns a real effective distance between two chemical sets. It is equal to 1 - r_similarity defined by function Score(X1,X2,..set) See also:

Score(X, X, .., set)

Distance matrix between two sets of coordinates

Distance ( M_coor1 M_coor2 )

Distance(Xyz(a_/1:5/ca) Xyz(a_/10:12/ca)

Distance( M_xyz1|as1 M_xyz2|as2 r_dist ) → l_yes_if_closer_than_dist This function returns a logical yes if any two points or atoms in two sets of coordinates or selections are closer than the threshold.
if Distance ( as1 as2 r_dist ) then ...

is a more efficient version of this condition:

if Nof( Sphere( as1 as2 r_dist )) > 0

Distance tether

Distance ( as [ r_defaultLength=-1.] )
- returns the real array of lengths of tethers for each selected atom or the default value ( -1. ). The default value can be set to any value. Tethers are assumed to be already set, see command set tether. Also note, that the expression Distance( as_out ) will give the same results if as_out selection was not changed by another operation; see also special selections.
Example:

read pdb "1crn" convert tether # keeps tethers to the pdb original deviations = Distance( a_//!h*,vt* , 9.9) perResDevs = Group( deviations, a_//!h*,vt* ,"max") # find max.devs per residue display ribbon color ribbon a_/* perResDevs # Another example Distance( a_//T ) # selects only tethered atoms #>R 1.677 1.493 1.386 1.435 1.645 1.570 2.165 1.399

Distance Dayhoff

Distance( seq1 seq2 [identity|evolution|new|fast|number|reverse] ) → r

Distance( r_dist_0_to_1|M_dist ) → r|M_Dayhoff_corrected_distances to [0.:10.]

Distance( seqArr[n] seq ) → R_n

Distance( d_0to1|M evolution ) → d|M_Dayhoff_correction_applied

- returns the real measure of difference between two aligned sequences. Zero distance means 100% identity. The distance is calculated by the following two steps:

d1 = 1.0 - (nResidueIdentities/Min(Length(Seq1), Length(Seq2)) (d1 belongs to [0.,1.] range)
if there is no identity option (or the sequence option for a general value to value transformation), the distance is corrected: Distance(Seq1,Seq2) = DayhoffTransformation( d1 )

Transformation practically does not change small distances d1, whereas large distances, especially above 0.9 (10% sequence identity) are increased to take occasional reversals into account. Distances d1 within [0.9,1.0] are transformed to [5.17, 10.] range.

The last function ( Distance( d|M evolution ) allows to apply Dayhoff correction that extends a distance from 0. to 1. to a range 0. to 10. to take into acount the evolutionary time correction (stretching) at larger distances because.

See also: Distance ( ali ) for distance and seq.identity matrices.

Distance between sequences or alignment sequences

Distance ( alignment ) → M_nxn

Distance( seqArr_n ) → M_nxn

Distance( seqArr_n seqArr_m ) → M_nxm

- returns matrix of pairwise sequence-sequence distances in the alignment. These distances are calculated with the fast option as follows

for each pair of sequences in multiple alignments calculate sequence identity according to the alignment as:
```
 
 1.-(nResidueIdentities)/Min(Length(Seq1), Length(Seq2)) 
```
the resulting number will be between 0. and 1.
apply the Dayhoff correction formula that will convert the linear distance to 'evolutionary' distance (kind of evolutionary time needed to arrive at this identity. Where the inifinite time is defined as 10. The resulting distance will be between 0. and 10. and area of 'insignificant linear distances' will be pushed to larger than 1. values. Between two identical sequences D= 0. , while the distance between two 30% different sequences will be around 0.5. The distance goes to an arbitrary number of 10. for completely unrelated sequences.

 
 read alignment msf s_icmhome+"azurins"            # read azurins.msf  
 NormCoord = Disgeo(Distance(azurins))   # 2D sequence diversity in  
#
# calculate pairwise sequence identities
 read alignment "aln" name="aln"
 n=Nof(aln)
 mids = 100*(Matrix(n,n,1.) - Distance(aln ))  # the pairwise seq. identities
 t = Table( mids, Name(aln), Name(aln) )  # to convert the matrix into pairwise table
 t = Table( mids, index )  # a simpler version with i,j

Distance between two alignments

Distance ( ali_1 ali_2 [ exact ] )

- returns the real distance between two alignments formed by the same sequences.
The distance is defined as a number of non-gap columns identical between two alignments.
Two different normalizations are available:
The default normalization is to the shorter alignment. ( Distance ( ali_1 ali_2 ) ). In this case the number of equivalent pairs is calculated and is divided by the total number of aligned pairs in the shorter alignment. This method detects alignment shifts but does not penalize un-alignment of previously aligned residue pairs. D = (La_min - N_commonPairs)/La_min In the following alignment the residue pairs which are aligned in both alignments are the same, therefore the distance is 0.

 
 show a1   # La1 = 3 
 ABC---XYZ 
 ABCDEF--- 
 show a2   # La2 = 6 
 ABCXYZ 
 ABCDEF 
 Distance(a1,a2)   # a1 is a sub-alignment of a2, distance is 0. 
 0.

exact option: normalization to the number of pairs of the longer alignment.

longer

D = (La_max - N_commonPairs)/La_max

 
 Distance(a1,a2,exact)  # returns 0.5 for the above a1 and a2

 
 read sequence msf s_icmhome+"azurins.msf"  
 gapOpen =2.2 
 a=Align(Azu2_Metj  Azup_Alcfa)   # the first alignment 
 gapOpen =1.9                     # smaller gap penalty and .. 
 b=Align(Azu2_Metj  Azup_Alcfa)   # the alignment changes 
 show 100*Distance(a b )          # 20% difference 
 show 100*Distance(a b exact )    # 21.7% difference 
 show a b

The distance of the cluster splitting level

Distance( treeArr i_at separator )

- return the current value of the cluster splitting level set by split command.

Chemical similarity distance

Distance( chemarray [pharmacophore] )

- return square matrix of chemical distances. The chemical distance is defined as the Tanimoto distance between binary fingerprints Option pharmacophore uses different fingerprints based on ph4-type triplets.

Example:


Distance( Chemical( { "CCC", "CCO"} ) )

Distance( chemarray1 chemarray2 [pharmacophore] )

- return a MxN matrix where M is number of elements in chemarray1 and N is number of elements in chemarray2 Option pharmacophore uses different fingerprints based on ph4-type triplets.

Example:


Distance( Chemical({ "CCC", "CCO"}) Chemical("CC" ))

Zero distance for non-identical compounds.Sometimes non-identical compounds can give a zero fingerprint distance due to the limitations inherent in finite length fingerprints. To make the distance more representative, one can mix different types of distances, e.g. for two chemical arrays X1 and X2


Mdist = Distance( X1, X2 ) + 0.1*Distance(X1,X2, pharmacophore)

Eigen

eigenvalues/eigenvectors function, eigendecomposition of a square diagonal matrix.
Eigen ( M ) → X_eigenVectorColums and R_out with eigen values
- returns the square matrix ( n x n ) of eigenvector columns of the input symmetric square matrix M_ . All n eigenvalues sorted by their values are stored in the R_out rarray.

Eigen value decomposition is be given by three matrices: X, Matrix(3,R_out) and Power(X,-1)
Example of an eigen-value decomposition:

 
# create a symmetric real matrix which describes a transformation
read matrix name="A" input="""
2. 0.6 0.5 
0.6 4. 0.3 
0.5 0.3 6. 
"""
 X = Eigen(A)              # calculate eigenvectors... 
 V = R_out                 # and save eigenvalues in rarray V 
 L = Matrix(3,V)           # diagonal matrix with eigen values
# note that now A can be reproduced by this calculation : X*L*X^^-1
 show A,  X*L*Power(X,-1)
# Eigenvectors are X[?,1],  X[?,2], X[?,3]
 show X[?,1]  # 1st eigen-vector

Energy

Energy ( string )

real

pre-calculated

terms

Important

calculated

show energy

minimize

ssearch

montecarlo

Note

Allowed terms in the string are "vw,14,hb,el,to,af,bb,bs,cn,tz,rs,xr,sf";
"func" stands for the total of all the terms, both energy and penalty;
"ener" is only the energy part (i.e. "vw,14,hb,3l,to,af,bb,bs,sf" );
"pnlt" is only the penalty part (i.e. "cn,tz,rs,xr" ).
load conf and load frame commands fill out all the energy/penalty terms, which are stored in both stacks and trajectory files (of course the values also depend on a set of free variables). You can get the energy/penalty terms of the loaded conformation without explicitly recalculating them using the Energy function, e.g. Energy("func")

Energy ( rs [ simple | base | s_energyTerms ] )

residue

convert

energy terms

fixation

Note:

unfix only V_//S,V

simple

base

simple

base

not

simple

base

"vw

vwMethod

vwSoftMaxEnergy

simple


build string "ASDF"
unfix only V_//S,V
add column t Name(a_/A full) Energy( a_/A simple )  Energy( a_/A base )
show t

electrostatic ( "el" ) term and electroMethod = "boundary element", "MIMEL", or "generalized Born"

s_energyTerms

 
  read pdb "1crn" 
  delete a_W 
  convert 
  set terms "vw,14,hb,el,to,en,sf" 
  group table t Energy( a_/A ) "energy" Label(a_/A ) "res" 
  show t 
  unfix V_//* 
  group table tBondsAngles Energy( a_/A "bs,bb" ) "covalent" Label(a_/A ) "res" 
  show tBondsAngles

calcEnergyStrain

Energy ( conf i_confNumber)

table

sarray of the energy term names ( .hd ) and
rarray of energy values for each energy term ( .ey ) and

Energy ({ stack | conf } )

 
 read object s_icmhome+"crn.ob"
 set terms only "vw,14,hb,el,to"  # set energy terms  
 show energy v_//xi*              # calculate energy with only  
                                  # side chain torsions unfixed  
           # energy depends on what variables are fixed since  
           # interactions inside rigid bodies are not calculated,  
           # and rigid body structure depends on variables  
 
 a = Energy("vw,14")      # a is equal to the sum of two terms  
 
 electroMethod="MIMEL"    # MIMEL electrostatics  
 set terms only "el,sf"   # set energy terms  
 show energy 
 print Energy("ener")     # total energy  
 print Energy("sf")       # only the surface part of the solvation energy  
 print Energy("el")       # electrostatic energy  
 print r_out              # electrostatic part of the solvation energy

Entropy

Entropy( R_frequencies ) → r_entropy

Entropy( R_energies r_RT_energy ) → r_entropy

Entropy( seq [simple|R_26aa_prob] ) → r_entropy

returns energy calculated as ∑_i p_i Log(p_i) where p_i probabilities are calculated either as normalized R_frequencies or exp( -(E-E_min)/ r_kT_energy ) factors.

If the frequency array contains only one element it is considered as the first probability of an array of two probability that should add up to 1., ie {0.2} is interpreted as {0.2, 0.8}.

The sequence entropy is calculated according to the residue probability from a standard amino acid frequency table. You may substitute it with your own array of 26 numbers. Option simple assumes residue frequencies to be 1/20.

Notes:

note that the entropy is unit-less generally proportional to the number of comparable high frequency/low_energy states.
Natural logarith (base e ) is used for the calculation
for Shannon entropy one needs to multiply the answer by 1./Log(2.) ≃ 1.4427
if one uses molar energies in kcal/mol , the temperature factor will be RT ≃ 0.6 kcal/mol

Examples:


Entropy({0.5, 0.5})      # 0.693147, two equi-prob. states
Entropy({0.2})           # 0.500402, two states: p=0.2 and p=0.8
Entropy({0.2, 0.2, 0.2})       # 1.09861  three states, 
Entropy({10.2, 0.2, 0.2, 0.2}) # 0.275593 one estates dominates
# below the numbers are interpreted as energies, and 1.4 is a temperature factor.
Entropy({-30., -28., -31., -15.}, 1.4) # 0.848429. ΔEs divided by 1.4, exponentials used to calc probs. 
Entropy({-30., -28., -31., -15.},100.) # 1.38432 at this high temperature 4 states are almost equi-probable.

Error

Error

yes

no.

yes

set error

Error ( string )
- returns string with the last error message. It also returns integer code of the last error in your script in i_out . In contrast to the logical Error() function, here the internal error code is not reinstalled to 0, so that you can use it in expressions like if( Error ) print Error(string) .
Error ( i_error_or_warning_code ) → l Error ( number ) → s - returns logical yes if an error or warning with the specified code occurred previously in the script. This call also resets the flag (e.g. Error(415) ). This is convenient to track down certain warnings or errors in scripts (e.g. detecting if 'readpdb{read pdb} found certain problems).
Option number will return a string will previously set error and warning messages.
To clear all bits use the clear error command.

Examples:

 
 read pdb "1mng"  # this file contains strange 28-th residue  
 if (Error) print "These alternative positions will kill me" 
 
 read pdb "1abcd"  # file does not exist 
 read pdb "1mok" 
 clear error

errorAction

s_skipMessages

l_warn

Warning

Error ( r_x [ reverse ] )

real

erfc(x)=1.-erf(x))

(2/sqrt(pi)) integral{x to infinity} of exp(-t²) dt

reverse

r_x

-r_x.

 
 show 1.-Error(Sqrt(0.5)) # P of being inside +-sigma (about 68%) 
 show Error(2.*Sqrt(0.5)) # P of being outside +- 2 sigma

Error ( R_x )

erfc(x)=1.-erf(x))

 
 x=Rarray(1000 0. 5. ) 
 plot display x Error(x ) {0. 5. 1. 1. 0. 1. 0.1 0.2 } 
 plot display x Log(Error(x ),10.) {0. 5. 1. 1.}   
       #NB: can be approximated by a parabola 
       #to deduce the appr. inverse function. 
       #Used for the Seq.ID probabilities.

Error (for SOAP messages)

Error( soapMessage )

- returns a error string from the SOAP message. (empty string if no error)

This function is used the check the result of calling SOAP method.

See: SOAP services for more details and examples.

Exist

[ Exist-pattern | Exist molcart ]

Exist ( s_fileName [ write | read | directory ] )

yes

no

write open for writing
read open for reading
directory the provided string is a directory (not file)

Exist( collection s_fieldname )

Checks if the field exists, e.g.


c = Collection(); c['a']=123
Exist(c,'a')  # yes
Exist(c,'b')  # no

Exist ( key, s_keyName ) - returns logical yes if the specified keystroke has been previously defined. Examples: Exist(key, "F1" , Exist( key, "Ctrl-B" ) See also: set key command.
Exist ( object ) - returns logical yes if there is at least one molecular object in the shell, no otherwise.
Exist ( os1 stack ) - returns logical yes if there is a built-in object stack , no otherwise.
Exist ( box ) - returns logical yes if the purple box is displayed, no otherwise.
Exist ( view ) - returns logical yes if the GL - graphics window is activated, no otherwise.
Exist ( gui ) - returns logical yes if the GRAPHICS USER INTERFACE menus is activated, no otherwise.

Exist ( grob display ) - returns logical yes if the grob is displayed.
Exist( connect ) - returns logical yes if the mouse rotations are connected to a graphical object ( grob ) or a molecular object.

Exist( s_table_name sql table ) - returns logical yes if there is an sql table with the specified name exists. It works with the Molcart tables or tables accessed via the Sql function.

Exist( variable s_varName ) - returns yes if the variable exists in the ICM shell, no otherwise. See also Type( ). E.g.


 Exist(variable, "aaa")  # returns no
 aaa=234
 Exist(variable, "aaa")  # returns yes

 
 if (!Exist("/data/pdb/") then 
   unix mkdir /data/pdb 
 endif 
 
 if(!Exist(key,"Ctrl-B")) set key "Ctrl-B" "l_easyRotate=!l_easyRotate" 
 
 if !Exist(gui)  gui simple

Exist( chemarray pattern )

returns logical yes if at least one of the elements contains SMARTS search attributes, no - otherwise.

Example:


Exist( Chemical("[C&H1,N]") pattern )  # returns yes
Exist( Chemical("CCO") pattern ) # return no

Database information

Exist( s_dbtable sql table )

- returns logical yes if the specified table exists in the database

Extension

Extension ( string [ dot ] )

dot

Extension ( sarray [ dot ] )

dot

 
 print Extension("aaa.bbb.dd.eee")   # returns ".eee"  
 show Extension({"aa.bb","122.22"} dot)         # returns {"bb","22"} 
 read sarray "filelist"                                     
 if (Extension(filelist[4])==".pdb") read pdb filelist[4]

Exp

Exp ( real )

real

Exp ( rarray )

rarray

Exp ( matrix )

matrix

 
 print Exp(deltaE/(Boltzmann*temperature))   # probability  
 print Exp({1. 2.})                          # returns { E, E squared }

Field

[ Field user ]

Field ( s [ s_precedingString] i_fieldNumber [ s_fieldDelimiter] )

s_fieldDelimiter

The s_fieldDelimiter string

Single

 
 Field("a b c",3," ")  # space 
 Field("a:b:c",3,":")  # colon

Alternative

 
 Field("a%b:c",3,"%:")  # percent OR colon

Multiple

repeating

two

 
 Field("a  b   c",3,"  ")    # two==multiple spaces in field delim 
 Field("a%b::::c",3,"%::")   # a single percent or multiple colons

s_fieldDelimiter

 
 s=Field("1 ener glu 1.5.",3)    # returns "glu"  
 show Field("aaa:bbb",2,":")     # returns "bbb"  
 show Field("aaa 12\nbbb 13","bbb",1) # returns "13"  
 show Field("aaa 12\nbbb 13 14","bbb",2,"  \n\n") # two spaces and two \n .  
# another example 
 read object s_icmhome+"all" 
       # energies from the object comments, the 1st field after 'vacuum' 
 show Rarray(Field(Namex(a_*.),"vacuum",1))

Field ( S , [ s_precedingString] i_fieldNumber [ s_fieldDelimiter] )

string array

s_fieldDelimiter

 
 show Field({"a:b","d:e"},2,":") # returns {"b","e"}  
 s=Field({"aa 2 3.3", "bb 4 1.3", "cc 31a 1.1 3"},2) 
         # returns {"2","4","31a"}  
 s=Field({"aa 2 3.3", "bb 4 1.3", "cc 31a 1.1 3"},4)   
         # returns {"","","3"}

Split

User field from a selection

Field( as|rs|ms|os [s_fieldName] )

Field( { rs | ms | os } [ i_fieldNumber ] )

Field( os 15 )

returns rarray of user-defined field values of a selection. Some fields are filled upon reading a pdb file
Atoms. Only one user defined field can be set to atoms, e.g.

 
read object s_icmhome+"crn.ob"
set field a_//* Random(0.,1.,Nof(a_//*)) 
show Field( a_//* ) 

read pdb "1f88" # rhodopsin, many loops missing
Field( a_ 15) # returns 31. residues
Field( a_ "pmid") # iarray[1] with pubmed id, automatically created by read pdb
set field a_/10,14,21 name="pocket"
display cpk Field ( a_/* "pocket" )

Residues, molecules and objects.

i_fieldNumber

Res

Mol

Obj

Res(Sphere(gg, a_1. 3.))

grob

gg

Upon reading a pdb file the object field 15 contains the number of residues missing from the ATOM records, but present in SEQRES records due to local disorder. Example:

 
read object s_icmhome+"crn.ob"
set field a_/A Random(0.,1.,Nof(a_/A)) number = 2 # set the 2nd field to random values 
GRAPHICS.atomRainbow= "yellow/green/blue/blue"    # optional redefenition of colors
color a_/* Field( a_/A  2 )                       # color by it

Standard fields:

object: "pmid" - integer pubmed id

set field as_ [ name= s ] .. ,
Smooth rs_ to generate 3D-averaged user fields
Select function to select by user defined field (e.g. Select( a_// "x>-1." ) ).

File

File ( os )

 
read pdb "/home/nerd/secret/hiv.ob" 
File( a_ ) 
 /home/nerd/secret/hiv.ob

File ( s_file_or_dir_Name "length" )

File ( s_file_or_dir_Name "time" )

File ( icm_object )

File ( s_file_or_dir_Name )

Note that this function will only work on Unix or Mac, see a`Exist ( s_file .. ) function for cross-platform functions. If file or directory do not exist the function returns "- - - - 0" Otherwise, it contains the following 4 characters separated by space and the file size:

type character:
- 'f' - regular file
- 'd' - directory
- 'l' - symbolic link
- 'c' - character special file
- 'p' - pipe
'r' if you can read the file (or from the directory)
'w' if you can write to this file (or directory)
'x' if you can execute this file (or cd to this directory)
file size in bytes

s_name)

i_fieldNumber)

s_name)

 
 if File("/opt/icm/icm.rst")=="- - - - 0" print "No such file"   
 
 if Field(File("PDB.tab"),2)!= "w"  print "can not write"   
 
 if ( Indexx( File("/home/bob/icm/") , "d ? w x *" ) ) then 
    print "It is indeed a directory to which I can write" 
 endif 
             # Here the Indexx function matched the pattern.  
 
 if ( Integer(Field(File(s_name),5)) < 10 ) return error "File is too small"

File ( last )

called

File(last)

Path

last

File ( T_IndexTable database )

returns the file name of the first source file indexed. Example:


  read index "nci"
	File( nci database)
 /data/chem/nci.sdf

Find

[ Find in array | Find in table | Find chemical ]

Find closest value in array

Find ( R_source r_value )

Find ( I_source i_value )

- returns index of the source array element which is closest to the value

Example:


Find( {10 20 30 40 50} 43 ) #will return 4 because 40 is the closest value
Find( {1. 2. 3.} 100. ) #will return 3

Floor

Floor ( r_real [ r_base ] )

real

r_base

r_real.

Floor ( R_real [ r_base] )

r_base

R_real.

r_base=

1.0

Hint: for rounding numbers you can use Floor( value + 0.5 )

Example:


rr = { 0.1, 0.5, 1.2, 1.7 }
Floor( rr + 0.5 )

Ceil

Formula

Formula( chemarray )

- returns the sarray of compounds' molecular formulas.

Getarg

function returning the value for an argument to ICM or an icm-script. If one runs icm directly, specify arguments after the -a option,

e.g.


icm -s  -a t=2 verbose c='some text' # three arguments passed to icm
icm_script t=2 verbose c='some text' # three arguments passed to icm_script

A summary of the Getarg functions:

: Getarg( )->
: Getarg( name )-> S_argNames
: Getarg( name [delete] )-> S_files,e.g. '.icb'
: Getarg( set|list|mol|keep|sarray )-> S_argValues # mol or keep adds stdin and keep for chunk access
: Getarg( [find|test] ) ->
: Getarg( [name] )-> # e.g. -verbose
: Getarg( [] [delete] )->
: Getarg( gui )-> # after Askg
: Getarg( gui )->


if Getarg(help) quit HELP
mid   = Getarg( "-mid",no,delete) # logical files = Getarg(list,delete)  # all args without '-'
files = Getarg(input,delete)  # file names (undashed args), appended with 'keep' and stdin if necessary
outfiles = Getarg(output,delete) # file names
files = Getarg(mol,delete)   # same for .sdf* files

c=Collection()
c["a" ]        = Getarg("-a",test)          # logical to activate the option
c["a_params" ] = Getarg("-a","10:30",delete) # defaults and params
c["m"]=Getarg("-m",test); c["mfrto"] = Getarg("-m","100:500",delete)

Getarg ( s_icmargName [s_default] [ delete ] )

Getarg ( s_int_argName [i_default] [ delete ] )

Getarg ( s_real_argName [r_default] [ delete ] )

Getarg ( s_log_argName [l_default] [ delete ] )

If the default value is provided, the returned object is cast to default value's type. Else the function tries to guess the return type based on the value format. If the default value is of logical type, the function returns the opposite value if the argument is found in the list. e.g. Getarg("-x",yes) will return no if the option was specified).

for icm or icm-script arguments like name returns a string with "yes". For argument name=value returns the argument value converted according to the default value. The default value is be returned if the argument is not specified. Option delete extracts the variable from the list.

Getarg( )

returns a concatenated list (`string) of all arguments prepared for interpretation by a Unix shell. This is convenient for passing arguments further to a nested script. Trim(Getarg(),all) will return the empty string if no arguments are found.

Getarg( list|keep [delete] )

returns sarray of non-option arguments (usually they are file names). Option keep adds the "stdin" for dash or no arguments, and adds keyword keepto keep the file open for multiple 'chunk' access to it.

Getarg( name )

returns sarray of argument names

Getarg( set )

returns sarray of argument values

Getarg( delete )

deletes all arguments and returns the number of them

Testing if the argument exists Getarg( s_argName [find|test] )

returns yes if the argument can be found in the list in any form.

Getarg( s_argName [name] )

returns yes if the argument is in the list as the name only (rather than the name=value pair). E.g. -verbose will return yes, and -verbose=2.3 will return no.

Getarg ( i_pos gui )

returns string which contain a user input after GUI dialog execution using Askg function.

Examples :


 if Getarg("-L" find) print "-L was found"
 t = Getarg("time","1.",delete)
 s = Getarg("sequence","ABC")
 Getarg("-L" yes ) # returns no in this case, yes is the default
 Getarg("-L" no ) # returns yes since no was the default
 args = Getarg(name)
 wrongArgs = NotInList({"s","t"} ,args) 
 if wrongArgs print " error> illegal arguments ", Sum(wrongArgs)


> icm -a time=1.5 sequence="ADEGFKL" -L file1 file2

An example with an icm script:


> cat script.icm
 #!icm -s
 x=Getarg("x","3")
 y=Getarg("y","a b c")
 show x,y

> script.icm x=33 y="d e"
 33, "d e"

An example of dialog input:


buf  = "#dialog{\"Select InSilco Models\"}\n"
buf += "#1 s_Some_Input (some text)\n"
buf += "#2 l_Check (no)\n"
buf += "#3 i_Number (4)\n"
Askg( buf )  # run the dialog
print Getarg( 1 gui ) Getarg( 2 gui ) Getarg( 3 gui )

Another example with a text box txw_ spec :


#dialog{ "Sample Dialog" }
# txw_Enter_Text ()
txt = %s_out   # s_out is not a safe place (might be overwritten)
print Length(txt)

See also Putarg , Getenv, script .

Getenv

function returning value for an environment name.
Getenv ( s_environmentName [s_default] )
- returns a string of the value of the named environment variable. If the default string is provided, it is used if the variable is not found.
Example:

 
 user = Getenv("USER")      # extract user's name from the environment  
 if (user=="vogt") print "Hi, Gerhard" 

 Getenv("HOME","you are homeless :-(") # use default if HOME is not found
 /home/ruben/
 Getenv("HOME_MISSPELLED","you are homeless :-(")
 you are homeless :-(
 Getenv("HOME_MISSPELLED") # error

Existenv

Putenv

Gradient

Gradient( )

real

Gradient ( vs_var )

Gradient ( as | rs )

GRAPHICS.displayLineLabels

(G[i] = Sqrt(Gxi*Gxi+Gyi*Gyi+Gzi*Gzi))

as_

rs_

(-Gxi, -Gyi, -Gzi)

Important:

Example:

 
 read object s_icmhome+"crn.ob"
 show energy                # to calculate the gradient and its components  
 if (Gradient( ) > 10.) minimize 
 show Max(Gradient(a_//c*)  # show maximum "force" applied to the carbon atoms

Grob

[ Grob-select-by-color ]

Grob( M_NxM )

- returns grob for 3D surface function: X=i,Y=j,Z=~~M_NxM
Grob( M_Nx3_xyz dot )

- returns grob of dots with xyz from M_Nx3_xyz

Grob( M_Nx3_xyz r_ra ball )

- returns grob of spheres with centers from M_Nx3_xyz

- returns grob of dots with xyz from M_Nx3_xyz and radius r_ra

Grob ( "arrow", { R_3 | R_6 } )
- returns grob containing 3D wire arrow between either 0.,0.,0. and R_3, or between R_6[1:3] and R_6[4:6].
Grob ( "ARROW", { R_3 | R_6 } )
- returns grob containing 3D solid arrow. You may specify the number of faces by adding integer to the string: e.g. "ARROW15" (rugged arrow) or "ARROW200" (smooth arrow).
See also: GROB.relArrowSize.
Examples:

 
 GROB.relArrowSize = 0.1 
 g_arr = Grob("arrow",Box( ))  # return arrow between corners of displayed box 
 display g_arr red            # display the arrow  
 
 g_arr1 = Grob("ARROW100",{1. 1. 1.}) 
 display g_arr1

Grob ( "cell", { R_3 | R_6 } )

grob

R_3

90.,90.,90.

Grob ( "CELL", { R_3 | R_6 } )

grob

R_3

90.,90.,90.

 
 read csd "qfuran" 
 gcell = Grob("CELL",Cell( ) )      # solid cell  
 display a_//* gcell transparent   # fancy stuff

Grob ( "distance", as_1 [ as_2 ] )

grob

 
  build string "se ala his trp" 
  g = Grob( "distance", a_/1/ca a_/2/ca ) 
  display g 
  GRAPHICS.displayLineLabels = no 
  display new

Grob ( "label", R_3, s_string )

grob

R_3

Grob ( "line", R_3N )

grob

R_3N[1:3]

 
 display a_crn.//ca,c,n 
 g = Grob("line",{0.,0.,0.,5.,5.,5.})   # a simple line (just as an example) 
 display g yellow 
 gCa = Grob("line",Rarray(Xyz(a_//ca))) # connect Cas with lines  
 display gCa pink                       # display the grobs

Grob ( "SPHERE", r_radius i_tesselationNum )

grob

i_tesselationNum

 
 display a_crn.//ca,c,n 
               # make grob and translate to a_/5/ca  
               # Sum converts Matrix 1x3 into a vector 
 g=Grob("SPHERE",5.,2)+Sum(Xyz(a_/5/ca))   
               # mark it with dblLeftClick and 
               # play with Alt-X, Alt-Q and Alt-W 
 display g red

Grob ( "TORUS", r_radius r_radius2 [R_normalVector] [i_quality] ) - returns grob containing a solid torus.

Grob ( "ELLIPSOID", r_radius r_radius2 [R_normalVector] [i_quality] ) - returns grob containing a solid ellipsoid.

Grob ( "CYLINDER", r_radius r_height [R_normalVector] [i_quality] ) - returns grob containing a solid cylinder.

Example


  t = Grob("TORUS", 1.2 0.2 )
  e = Grob("ELLIPSOID" 1 0.4 )
  display smooth t red
  display smooth e blue

Example (display plane of the phenyl ring)


build smiles "(CC(C)Cc1ccc(cc1)C(C)C([O-])=O)"
display xstick a_
find chemical a_ "c1ccccc1"  # result is stored into as_out
n = Normalize(Vector( Rarray(Xyz(as_out[2])-Xyz(as_out[1])) Rarray(Xyz(as_out[3])-Xyz(as_out[1])) ),"euclidean" ) # normal
gr_plane = Grob( "CYLINDER", 2. 0.05, n ) + Mean( Xyz( as_out  ))
display smooth transparent gr_plane

Grob( grob R_6rgbLimits )

grob

from_R, to_R, from_G, to_G, from_B, to_B

{0.9,1.,-0.1,0.1,-0.1,0.1}

Sphere

as_

 
build string IcmSequence("ADERD") # a peptide
dsRebel a_  no no
g=Grob(g_electro_def_ {0.9,1.,-0.1,0.1,-0.1,0.1} )  # red color
display g_electro_def_ transparent
display g
show Res(Sphere( g, a_//* 1.5))

color grob

GROB.atomSphereRadius

Group function

Group ( R_n_atoms as_n_atoms "min"|"max"|"avg"|"rms"|"sum"|"first" ) → R_resArray

Group ( I_n_atoms as_n_atoms "min"|"max"|"avg"|"sum"|"first" ) → I_resArray

Group ( as_atomSelection "count" ) → I_resArrayOfNat
returns an array of atoms properties aggregated to a per-residue array. One of the following functions can be applied to the atomic values:

"min" - stores the minimal atomic property for each selected residue
"max" - stores the maximal atomic property for each selected residue
"avg" - (syn. "mean") stores the mean of properties for each selected residue
"rms" - stores the root-mean-square deviation of properties for each selected residue
"sum" - stores the sum of properties for each selected residue
"first" - stores the property of the first atom in selected residue
"count" - stores the number of selected atoms in selected residue

 
read pdb "1crn" 
show Group( a_A//* "count"  ) # numbers of atoms in residues 
show Group( Mass( a_A//* ) , a_A//* "sum"  ) # residue masses 
show Group( Mass( a_A//* ) , a_A//* "rms"  ) # residue mass rmsd

Header

Header ( os )

returns sarray with the PDB entry information stored in the requested objects. PDB entry information is stored in objects in HTML format. Use Header( os1_ )[1] for a single string.

In order to be able to access the additional information in the objects' header, they should be read from PDB using the read pdb command with the header option.

Notice that if the object was read with the read pdb html option the header will be in html format, while it if the header option was used instead, the entire header will be stored as is.

Example:


read pdb "1crn" html
h1 = Header( a_1crn. )[1]
set property h1 html

Iarray

Iarray( [i_n=0 [i_default=0]] )

Iarray( R|S|I ) → I

Iarray( I reverse ) → I_reverseOrder

Iarray( I key ) → I_compress01intoInts # obsolete

Iarray( stack ) → I_nofVisits

Iarray( as ) → I_atomCodes

Iarray( as topology ) → I_atomSymmetryNumbers

Iarray( rs|ms|os ) → I_nAtomsInEachRes|Mol|Obj

Creating or converting into an iarray

Iarray ( i_NumberOfElements [ i_value ] )
- returns iarray of i_NumberOfElements elements set to i_value or zero. You can also create an zero-size integer array: Iarray(0) .
Iarray ( rarray )
- returns iarray of integers nearest to real array elements in the direction of the prevailing rounding mode magnitude of the real argument.
Iarray ( sarray ) - converts sarray into an iarray.
Examples:

 
 a=Iarray(5)                        # returns {0 0 0 0 0}  
 a=Iarray(5,3)                      # returns {3 3 3 3 3}  
 b=Iarray({2.1, -4.3, 3.6})         # returns {2, -4, 4}  
 c=Iarray({"2", "-4.3", "3.6"})     # returns {2, -5, 3}

Reversing the order of elements in an integer array.

Iarray ( iarray reverse )

 
Iarray({1 2 3} reverse)  # returns {3 2 1}

Sarray

reverse

reverse

String

Generating a compressed integer bit vector for Tanimoto calculations.

Iarray( I_nBitVector key ) - returns a shorter vector of integers if n/32 elements, in which every 32 array values of zeros and non-zeroes are compressed into one integer. The number of elements n does not need to be a multiple of 32, the missing elements will be assumed to be zero. Example:


Iarray({1 0 1 0 0 0 0},key) # returns {5}
Iarray({1 1 1 0 0 0 0},key) # returns {7}

See also:

Distance ( I_n I_n nBits key ) → M_nxm_Tanimotos

Iarray( ~~as_ ): relative atom numbers of a selection

- returns iarray of relative atom numbers in a single object. This iarray can be saved and later reapplied with the Select ( os_ I ) function. If you selection covers more than one object, the function returns an error.
Example:

 
build string "se ala" 
ii = Iarray( a_//c* ) # returns {6,8,12} 
Select( a_  ii )      # returns three carbons

Iarray( [as_|rs_] number ): residue numbers of a selection

- returns iarray of residue numbers for an input selection.

Example:


build string "ala glu"
Iarray( a_/ number )  # residue level
Iarray( a_// number ) # atom level

Iarray( stack ): numbers of visits for all stack conformations

Iarray( stack )

stack

nvis>

show stack

 
 show stack 
 iconf>       1       2       3       4       5 
 ener>    -15.3   -15.1   -14.9   -14.8   -13.3 
 rmsd>     84.5    75.3     6.4    37.2   120.8 
 naft>        3       0       4       0       2 
 nvis>       10       9       8       1       4 
Integer(stack)  # returns { 10 9 8 1 4 }

IcmSequence

icm.se

IcmSequence ( { sequence | string | rs }, [ s_N-Term, s_C-Term ] )

icm.se

a sequence, e.g. IcmSequence(1crn_m)
a string, e.g. "ASDGFRE", or "SfGDA;WER" .
or residue selection, rs_ , (e.g. a_2,3/* ).

standard L amino-acids: upper case one-letter code (B,J,X,Z are illegal), e.g. ACD
D-amino acids: lower case for a corresponding amino acid (e.g. AaA for ala Dala ala )
new molecule: use semicolon or dot as a chain separator ( ; ) ( e.g. AAA;WWW )

prou

pro

oxt

 
IcmSequence( a_/* ) # C-terminal residue "cooh" will be added if oxt is found 
IcmSequence( a_/* "","" )	 # no terminal groups will be added 
IcmSequence( a_/* "","@coo-" )    # "coo-" will be added only if oxt is found 
IcmSequence( a_/* "nh3+","coo-" ) # "nh3+" and "coo-" will always be added

ICM mol-sequence file

icm.res

 
 write IcmSequence(seq1) "seq1.se"      # create a sequence  
                                        #  file for build command  
 
 show IcmSequence("FAaSVMRES","nh3+","coo-")  # one peptide with Dala 
 
 show IcmSequence("FAAS.VMRES","nter","cooh") # two peptides  
 show IcmSequence("AA;MRES","nter","cooh") # two peptides  
 
 read pdb "2ins" 
 write IcmSequence(a_b,c/* ,"nter","@cooh") "b.se" # .se file for b  
                                                   # and c chains

oxt

build string

 
 build string "SDSRAARESW;KPLKPHYATV"  # two 10-res. peptides

icm.se

Image

Image( slides )

- returns the image array containing slide thumbnails. E.g.

 
group table t Image( slideshow.slides )

Image( grob texture )

- returns the image array with textures stored in the grob

Index numbers of selected table rows

Index ( T_tableExpression_orSelection ) → I_matchingRows

Index ( T_table_with_graphical_selection_or_rows selection ) → I_matchingRows
- returns an integer array of order numbers (indices) of rows selected by the table expression. Example in which we find which value of column B corresponds to a value in column A:

 
   group table t {33 22 11} "A" {"a","b","c"} "B" 
   Index(t.A==22)  # returns 2 for 2nd row 
   #>I 
   2 
   t.B[ Index(t.A== 22 )[1] ]   # returns B according to A value 
   b

Index numbers of labeled table rows

Index ( T_table i_label label ) → I_matchingRows

- returns an integer array of order numbers (indices) of rows with labels equal to

See also: set label table Label

Indexes of unique elements in an array

Index ( S_data unique ) → I_indexes

Index ( I_data unique ) → I_indexes

- returns iarray containing indexes of unique elements in the data array, sorted in ascending order.

Examples:


test Index( {1 7 5 7 2 1 1 5} unique )=={1 2 3 5}
test Index( {"a" "A" "a" "B" "A"} unique )=={1 2 4}

Index numbers of selected table rows

Index ( S_data, s_value [reverse] ) → i_FirstOrLastMatchingElement

Index ( I_data, i_value [reverse] ) → i_FirstOrLastMatchingElement

Index ( rarray, real [reverse] ) → i_FirstOrLastMatchingElement
- returns integer value indicating the first (or last with reverse option) array element number exactly matching the value string or real, or 0 otherwise. To return an array of matches in an array, use the all option (see below).
Examples:

 
 show Index({"Red Dog","Amstal","Jever"}, "Jever")      # returns 3  
 show Index({"Red Dog","Amstal","Jever"}, "Bitburger")  # returns 0  
 show Index({3 ,2, 8},2 )  # returns 2  
 show Index( 0.3//0.1//0.2//ND//0.5, ND ) # returns 4
 show Index( 0.3//0.1//0.2//ND//0.5, 0.1 ) # returns 2

Index ( S_data, s_value all ) → I_matchPositions

Index ( I_data, i_value all ) → I_matchPositions

Index ( R_data, r_value all ) → I_matchPositions
- returns iarray listing all positions where the value was encountered.

Index ( I_indexes, i_nofElements inverse ) → I_complementarySetOfIndexes

- this function returns a complement of the input set of indexes. It is similar to a negation of a selection.

Examples:

 
 show Index({"A","B","C","B","B"}, "B")      # returns {2,4,5}  
 show Index({1,2,6,4},3)  # returns empty iarray  
 show Index({1,3} 5 inverse) # returns {2,4,5}

Index ( alignment, sequence )

returns integer index of an identical sequence in the alignment of 0.

Index ( alignment selection column ) or Index ( alignment rs )

returns an iarray of column positions selected graphically in the alignment. See also: macro calcSelSimilarity
Index ( object )
- returns integer value of sequential number of the current object in the molecular object list, or 0 if no objects loaded. (Note that here object is used as a keyword.)
Examples:

 
 l_commands = no 
 read pdb "1crn" 
 read object s_icmhome+"crn" 
 printf "The object a_crn. is the %d-nd, while ...\n", Index(object) 
 set object a_1. 
 printf "the object a_1crn. is the %d-st.\n", Index(object)

Cluster selection and centers

Index ( tree center [r_threshold] ) - returns cluster centers (current threshold is taken if not specified)

Index ( tree selection ) - returns indices of table rows which are selected in cluster

Compare arrays and return sets with overlapping and unique indexes

Index ( {iarray|rarray|sarray}, {iarray|rarray|sarray} compare )

- returns a collection object with four fields:

"A" - indexes of the elements from the first array which do not present in the second
"B" - indexes of the elements from the second array which do not present in the first
"AB" - indexes of the elements from the first array which do present in the second (overlap)
"BA" - indexes of the elements from the second array which do present in the first (overlap)

Example:


 a = Random(1,100,50 )
 b = Random(1,100,60 )
 c = Index( a, b, compare  )
 show a[ c["A"] ]    # elements only in 'a'
 show a[ c["AB"] ]   # overlap 
 show b[ c["B"] ]    # elements only in 'b'
 show b[ c["BA"] ]   # overlap
 printf "The total number of unique elements is %d\n",Nof(c["A"]//c["B"]//c["AB"])

Unique(Sort(a//b))

Find atom number mapping/correspondence between two chemicals

Index( X_single_chem1, X_single_chem2 atom map ) → iarray

X_single_chem2 should be substructure or equal to the X_single_chem1.

- returns iarray of the length equal to the number of atoms in the X_single_chem2. Each element of the result array contains an atom number in X_single_chem1 which corresponds to the atom number == position_in_the_array in ~X_single_chem2

Example:


Index( Chemical("CCO"), Chemical("OCC") atom map )  # returns {3 2 1}

Indexx

function to find location of substring pattern.
Indexx ( { string | sequence }, s_Pattern )
- returns an integer value indicating the position of the s_Pattern (see pattern matching) in the string, or 0 otherwise. Allowed meta-characters are the following:

* any string including an empty string;
? any single character;
[ string ] any of the enclosed characters;
[! string ] any but the enclosed characters.
^ beginning of a string
$ string end

 
 show Indexx("asdf","s[ed.]")       # returns 2  
 show Indexx("asdfff","ff$")       # returns 5 (not 4) 
 show Indexx("asdf" "w?r")          # return 0

Insertion

Insertion ( rs_Fragment, ali_Alignment [, seq_fromAli ][, i_addFlanks ] [{"all"|"nter"|"cter"|"loop"}] )

residue selection

ali_Alignment.

seq_fromAli

rs_Fragment

seq_fromAli

linked

Deletion

i_addFlanks

"all" (or no string option) select insertions of all types
"nter" select only N-terminal fragments
"cter" select only C-terminal fragments
"loop" select only the internal loops

 
 read pdb "1phc.a/"   # read the first molecule form this pdb-file 
 read pdb "2hpd.a/"   # do the same for the second molecule 
 make sequences a_*.  # you may also read the sequence and 
                      # the alignment from a file 
 aaa=Align( )         # on-line seq. alignment. 
                      # You may read the edited alignment  
                      # worm representation 
 assign sstructure a_*. "_"  
 display ribbon 
 link a_*. aaa        # establish connection between sequences and 3D obj. 
 superimpose a_1. a_2. aaa 
 display ribbon a_*. 
 color a_1. ribbon green 
 color ribbon Insertion(a_1.1 aaa) magenta 
 color ribbon Insertion(a_2.1 aaa) red 
 show aaa

Info

Info ( [ string ] )

Info

Info ( display )

View

write object auto

write object display=yes

Info ( term [map|mmff] )

- returns the string with energy terms. E.g.

 
s_oldterms = Info(term)
..
set terms only s_oldterms

If option map is specified, ICM starts looking for m_gc, m_ge, .. etc. maps and adds a corresponding term. E.g.


s_termsAccordingToExistingMaps = Info(term map)

If option mmff is specified, ICM will select the correct set of the mmff terms.

Image details

Info ( images ) returns sarray with advanced details of images, such as their file format (JPEG, PNG, etc.), dimensions, color space (e. g. RGB, grayscale), transparency, etc.

Prediction model details

Info ( predModel )

Info ( s_builtInModelName model )

returns collection with model properties: type, weights, constant, etc..

Example


Info( "MolLogP" model )

Integer

function converting to integer type.
Integer ( l_value )
- returns 0 or 1.

Integer ( r_toBeRounded )
- returns the integer nearest to real r_toBeRounded in the direction of the prevailing rounding mode magnitude of the real argument.
Integer ( string ) - converts string into integer, ignores irrelevant tail. see also Tointeger Reports error if conversion is impossible.
Examples:

 
 show Integer(2.2), Integer(-3.1)      # 2 and -3  
 jj=Integer("256aaa")                  # jj will be equal to 256

Integral

Integral ( I | R )

returns iarray (or rarray) of the same dimension containing partial sums (from 1 to i ) of the element in the source array. E.g. Integral({2.,2.,2.}) will return 2.,4.,6.
Integral ( R r_xIncrement )

- calculates the integral rarray of the function represented by rarray Ron the periodically incremented abscissa x with the step of r_xIncrement. Note the difference between this and the above function of partial sums. The explicit increment form of the function will do the following

create (n-1) intervals (not n) ,
will always start from the 0. element
each next element will be incremented by the area under linearly interpolated broken line.

Integral({2.,4.,2.}

0.,3.,6.

Integral({2.,4.,2.})

2.,6.,8.

Integral ( R_Y R_X )

- calculates the integral rarray of the function represented by R_Y on the set of abscissa values R_X.
Examples:

 
# Let us integrate sqrt(x)  
 
 x=Rarray( 1000 0. 10. ) 
 plot x Integral( Sqrt(x) 10./1000. ) grid {0.,10.,1.,5.,0.,25.,1.,5.} display 
  
# Let us integrate x*sin(x). Note that Sin expects the argument in degrees 
    
 x=Rarray( 1000 0. 4.*Pi )              
# 1000 points in the [0.,4*Pi] interval 
 plot x Integral( x*Sin(x*180./Pi) x[2]-x[1] ) \ 
    {0., 15., 1., 5., -15., 10., 1., 5. } grid display  
# x[2]-x[1] is just the increment

    
 x=Rarray(100 ,-.9999, .9999 ) 
 x=x*x*x 
 plot display x Integral((3*x*x-1.) x) cross

Interrupt

Interrupt

yes

 
 if (Error | Interrupt) return

This method is now replaced by the setting of the interruptAction preference, e.g.


   interruptAction = "break all loops"
#or
    interruptAction = "exit macro"

Label

Label ( g ) → s

set grob

s_label

Label ( as )

make map potential m R_6box

set atom label

Label ( rs )
- returns sarray of residue labels of the selected residues rs_ composed according to the resLabelStyle preference , e.g. { "Ala 13","Gly 14"}
See also: Name function (returns residue names), and Sarray( rs [append|name|residue]) function returning selection strings.
Label ( os_objects )
- returns sarray of long names of selected objects.
See also: Name function which returns the regular object names and the most detailed chemical names of compounds.
Label ( vs_var )
- returns sarray of labels of selected variables.
Examples:

 
 build string "ala his glu lys arg asp"
 resLabelStyle = "Ala 5"  # other styles also available  
 aa = Label(a_/2:5)       # extract residue name and/or residue number info  
 show aa                  # show the created string array

Label ( T_table )

- returns iarray of table row labels (marks) set from the GUI or by set label command

Examples:


group table t {1 2 3} "A"
set label t 1 index={1,3}
Label(t)

Label ( chem chiral )

- returns sarray of chiral labels for the set of compounds.

Each element of the array may have one of the following values:

"" (empty - chiral compound)
"chiral" (chiral compound)
"racemic" (compound with one or more undefined chiral centers)

See also: set label table Index table label , Nof( X chiral [ 0|1|2.. ] )

Laplacian operator

The Laplace operator is a second order differential operator. The Laplacian of ƒ where f is defined in 3D space as map on a grid is the sum of all the unmixed second partial derivatives in the Cartesian coordinates x_i

Length

function.
Length ( { string | matrix | sequence | alignment | profile } )
- returns integer length of specified objects.
Length ( sarray )
- returns iarray with lengths of strings elements of the sarray.

Length ( X_chem1_single i_at1 i_at2 )

- returns integer length in bonds between two atoms in a single molecule

Length ( as1_single as2_single )

- returns integer length in bonds between two atoms in a single molecule (-1 if atoms are not connected)
Length ( X_chemarray link )

- returns iarray with number of bonds in a shortest path between two marked attachment points.

Length ( seqarray )

- returns iarray with lengths of sequence parray elements.

Length ( {iarray | rarray } )
- returns the real vector length (distance from the origin for a specified vector Sqrt(Sum(I[i]*I[i])) or Sqrt(Sum(R[i]*R[i])), respectively).
Examples:

 
 len=Length("asdfg")     # len is equal to 5  
 
 a=Matrix(2,4)            # two rows, four columns  
 nCol=Length(a)           # nCol is 4  
 
 read profile "prof"      # read sequence profile  
 show Length(prof)        # number of residue positions in the profile  
 
 vlen=Length({1 1 1})     # returns 1.732051

LinearModel

LinearModel( T_weights )

creates a linear regression prediction-model like: Y = 5*A + 10*B + 20The resulting model can then be applied to any table with columns required by the model.

The T_weights table should have two columns: sarray called "name" with column names, and rarray "w" with weights. It may also have a real header "b" specifying the free term (the default value is 0.).

For example, tables produced by the model weight function for other regression models may be used as input for LinearModel. So it is possible to obtain weights from a PLS model, refine or simplify them, and create a new linear regression model:


n = 1000
add column T Random(-10., 10., n) name="A"
add column T Random(-10., 10., n) name="B"
add column T Random(-10., 10., n) name="C"
add column T T.A + 10.*T.B - 5.*T.C name="Y"
learn T.Y type="plsRegression" name="Y"
Y1 = LinearModel( Table( Y term ) )
predict T Y1

A simple model example: Y = 0.7*A + 2.3*B - 10.*C + 5.6


# Build model
add column WT {"A", "B",  "C"} name="name"
add column WT {0.7, 2.3, -10.} name="w"
add header WT 5.6 name="b"
Y = LinearModel( WT )
# Predict
n = 100
add column T Random(-10., 10., n) name="A"
add column T Random(-10., 10., n) name="B"
add column T Random(-10., 10., n) name="C"
predict T Y

See also: Table model , predict , learn

Log

the logarithm function.
Log ( real ) - returns the real natural logarithm of a specified positive argument.
Log ( real r_realBase) - returns the real logarithm of a specified positive argument (e.g. the base 10 logarithm is Log(x, 10)).
Log ( rarray ) - returns an rarray of natural logarithms of the array components (they must not be negative, zeroes are treated as the least positive real number, ca. 10^-38).
Log ( rarray r_realBase ) - returns an rarray of logarithms of the array components (they must not be negative), arbitrary base.
Log ( matrix [ r_realBase ] ) - returns a matrix of logarithms of the matrix components (they must not be negative).
Examples:

 
 print Log(2.)          # prints 0.693147  
 print Log(10000, 10)    # decimal logarithm 
 print Log({1.,3.,9.}, Sqrt(3.)) # {0. 2. 4.}

Map

Map( m_map cell )

map

Map( m_map , I_6box [ simple ] ) - returns map which is a transformation (expansion or reduction) of the input m_map to new I_6box box ({ iMinX,jMinY,kMinZ,iMaxX,jMaxY,kMaxZ}). Note that the order of axes in most crystallographic is defined by the MAPS,MAPR,MAPS parameters and is not always x,y,z. The correctly ordered index is returned by the Index(

Map( m_map , as ) returns a map around selected atoms . The index box of this selection is returned by the Index( ) function
Examples:

 
 read object "crn" 
 read map "crn"   
 display a_//ca,c,n m_crn 
 m1 = Map(m_crn, {0 0 0 22 38 38})  # half of the m_crn  
 m2 = Map(m_crn, {0 0 0 88 38 38})  # double of the m_crn  
 display m1 
 display m2

make grob map

Mass

Mass( as | rs | ms | os )

 
 build string "ala his trp glu"  
 objmasses = Mass( a_*. ) 
 molmasses = Mass( a_* ) 
 resmasses = Mass( a_/* ) 
 
 masses=Mass( a_//!?vt* )   # array of masses of nonvirtual atoms  
 molweight = Mass( a_1  )[1]   # mol.weight of the 1st molecule  
 molweight = Sum(Mass( a_1//* )) # another way to calculate 1st mol. weight

Nof

sel

atom

Charge

sel

Moment

sel

Moment

Moment( as_nObj|X_n [ pca | simple | all ] )

returns an array of principal moments of inertia for the selected atoms in each selected object. The input array can also be a parray of chemicals (see Chemical ). Options:

pca (or no option) : the function returns an array with 3*nObject elements with 3 principal moments of inertia for the selection in each object
simple : the function returns an array with nObject elements with the largest principal moments of inertia (out of three) for the selection in each object
all : the function returns an array with nObject elements with the product of square roots of the principal moments of inertia for the selection in each object. For linear molecules Ixx==Iyy and Izz=0.. In that case the function returns Ixx rather than sqrt(Ixx*Iyy*Izz)

Example:


build string "ASD"
build string "G"
Moment(a_*.//* ) # three components for each object
   3470.9 # first object
   2886.5
   855.1
   167.5  # second object
   124.9
   48.4
Moment(a_*.//* simple ) # two largest moments of inertia
   3470.980225
   167.546844
Moment(a_1./2:3/ca pca ) # just two atoms: a linear molecule
   86.0
   86.0
   0.0

Match

Match( s_where s_regexp [i_field=0 [i_startPos=1]] ) → s_match

- returns the matched substring (or empty string). Example with parsing swiss id, name and description (see macro readUniprot):


id_sw  = Match(swissEntryHtmlLine, "<DT><A HREF=\"/uniprot/(.+)\">(.+)</A> \(<b>.+</b>\)<DD>(.+)" 1)  
namesw = Match(swissEntryHtmlLine, "<DT><A HREF=\"/uniprot/(.+)\">(.+)</A> \(<b>.+</b>\)<DD>(.+)" 2)
descsw = Match(swissEntryHtmlLine, "<DT><A HREF=\"/uniprot/(.+)\">(.+)</A> \(<b>.+</b>\)<DD>(.+)" 3)

minimal and greedy match Check regexp syntax for the full description of the rules. Some important hints: add question mark (?) to the end of a matching expression to make the match minimal (to the closest separator). Without '?' the match will be greedy i.e. it long for the longest match. Example:


  Match( "bla =  stuff; and more", "=\s+(.*?)\s",1) # ? for minimal
 stuff;
  Match( "bla =  stuff; and more", "=\s+(.*)\s",1) # greedy match
 stuff; and

Case sensitivity To make the match case insensitive use the "(?i)" or the "(?-i)" prefix (see also regexp syntax and simple expressions ) Example:


 s= "Some text\n Smiles  = C1CCCC1 \nmore text"
 Match(s,"(?i)smiles\s+=\s*(.+?)\s",1)    
   # ? in (.+?) means the minimal match, 1 refers to the (..) expression
 C1CCCC1

Match( all s_where s_regexp [i_field=0 [i_startPos=1] ) → S_matches

- returns an sarray with all matched expression

Match( S_where s_regexp [i_field=0 [I_startPos={1,..}]] ) → S_matches

- returns an sarray with matched substrings, the resulting array has the same size as the input array

Matrix

Matrix : create new matrix.

Matrix( i_NofRows, i_NofColumns [ r_value] ) - returns matrix of specified dimensions. All components are set to zero or r_value if specified.
Matrix( i_n [ R_n_diagonal ] )

Matrix( i_n [ R_m_row ] )

- returns square unity matrix of specified size. A matching array of diagonal values can be provided. If the array size does is not equal to i_n , a matrix with i_n rows with R_m_row values will be returned. Example:

 
Matrix(3,{1. 2. 3.}) 
 #>M 
 1. 0. 0. 
 0. 2. 0. 
 0. 0. 3. 

Matrix(3,{1. 2.}) 
#>M 
 1. 2. 
 1. 2. 
 1. 2.

Matrix( nRows [ R_row ] )

multiples R_row vector nRows times into a matrix. Make sure that nRows is not equal to Nof( R_row ) . Example: Matrix(10, {1. 2. 3.})
Matrix( rarray [ n ] ) - converts vector[1:n] to one-row matrix[1:1,1:n]. If you provide a positive integer argument, the input rarray will be divided into rows of length n. If the argument is negative, it will be split into columns of length n. Examples,

 
 Matrix({1. 2. 3. 4. 5. 6.},3) 
 #>M 
 1. 2. 3. 
 4. 5. 6. 
 Matrix({1. 2. 3. 4. 5. 6.},-3) 
 #>M 
 1. 4. 
 2. 5. 
 3. 6. 
 Matrix({1. 2. 3. 4. 5. 6.},4) 
 Error>  non-matching dimension [4] and vector size [6]

Matrix : extract a sub-matrix

Matrix( M_square i_rowFrom i_rowTo i_colFrom i_colTo ) → M

a submatrix of specified dimensions. To select only columns or rows, use zero values, e.g.

 
 Matrix( Matrix(3) 0,0,1, 2)  # first two columns

Matrix symmetrization, extraction of left and right triangles.

Matrix( M_square { left | right } ) - generate a symmetric matrix by duplicating the left or the right triangle of initial square matrix. Example:

 
icm/def> m 
#>M m 
1. 0. 0. 
0. 1. 0. 
7. 0. 7. 
icm/def> Matrix( m right ) 
#>M 
1. 0. 0. 
0. 1. 0. 
0. 0. 7. 
icm/def> Matrix( m left ) 
#>M 
1. 0. 7. 
0. 1. 0. 
7. 0. 7.

Matrix of RGB and other color characteristics

Matrix( S_nHexcolors rgb | color )

function returns a matrix of n rows and 3 columns for each of the rgb (red, green, blue) values. With option color it adds three additional columns for

intensity = 0.333 * (R+G+B)
chroma = Max(R,G,B)-Min(R,G,B)
lightness = 0.5*( Max(R,G,B)+ Min(R,G,B) )


makeColorTable  # this macro calls the Matrix( .. color ) function
#
Matrix( {"#FFFFAA","#ACBB01"} rgb )
Matrix( {"#FFFFAA","#ACBB01"} color )


makeColorTable  # create a table
add header icmColors Distance(Matrix(icmColors.Color rgb) ) name="dm"
# click on the cluster tool

Matrix of residue substitution values.

Matrix( comp_matrix s_newResOrder )

- returns comparison matrix in the specified order. Example in which we extract cysteine, alanine and arginine comparison values:

 
icm/def> Matrix(comp_matrix "CAR") 
#>M 
2.552272 0.110968 -0.488261 
0.110968 0.532648 -0.133162 
-0.488261 -0.133162 1.043102

Converting table columns into matrix

Matrix ( T [ S_colnames ] ) → M

Example:


add column t {1 2} {3 4} {4 5} # columsn .A .B .C
M  = Matrix( t )               # 3x2 matrix
mm = Matrix( t {"B","C"} )     # 2x2 matrix with .B and .C

The inverse operation is also possible with the Table ( matrix , S_colNames ) function.

Matrix resulting from a tensor product fo two vectors.

Matrix( R_A R_B )
- returns tensor product of two vectors or arbitrary dimensions: M_ij = R_A[i]*R_B[j]
Examples:

 
 mm=Matrix(2,4)          # create empty matrix with 2 rows and 4 columns 
 
 mm=Matrix(2,4,-5.)      # as above but all elements are set to -5. 
 
 show Matrix(3)          # a unit matrix [1:3,1:3] with diagonal 
                         # elements equal to 1. 
 a=Matrix({1. 3. 5. 6.}) # create one row matrix [1:1,1:4 ] 
 
 Matrix({1.,0.},{0.,1.}) # tensor product 
 #>M 
 0. 1. 
 0. 0.

Matrix of residue-residue contacts

Matrix ( rs_1 rs_2 )

- returns matrix of contact areas. See also: Cad, Area .

Matrix of inter-sequence distances

Matrix ( ali )

- returns a matrix of normalized pairwise Dayhoff evolutionary distances between the sequences in alignment ali_ (for similar sequences it is equal to the fraction mismatches).
Matrix ( ali, number ) - returns a matrix of alignment. It contains reference residue numbers for each sequence in the alignment, or -1 for the gaps. The first residue has the reference number of 0 (make sure to add 1 to access it from the shell).

Matrix from the electrostatic boundary element calculation

Matrix ( boundary ) - returns values generated by the make boundary command for each atom.

Matrix of distances between the stack conformations

Matrix ( stack ) - returns distance matrix of stack conformations according to the compare command and the vicinity parameter. Used for clustering of the stack conformations.

Matrix containing a histogram

Matrix( R_Xn R_Yn R_ruler ) - retuns 2D histogram of X and Y values. The R_ruler array consists of limits for X and Y and step sizes for X and Y and optional bin sizes: {xFrom, xTo, yFrom, yTo, [xStep, yStep] } . Returned values:

R_out : contains ruler and actual bin sizes:

Example:

 
icm/def> Matrix(Random(0. 5. 20) Random(0. 5. 20) {0. 5. 0. 5. 1. 1.}) 
#>M 
1. 0. 2. 1. 0. 
1. 1. 1. 1. 2. 
0. 2. 1. 1. 0. 
0. 1. 0. 0. 1. 
1. 0. 0. 1. 2.

Connectivity Matrix for Grob Vertices

Matrix( grob wire ) → M_one_or_large_number

Returns a matrix n_vertices by n_vertices containing 1. for connected vertices and a large number for unconnected. Example:

16777216. 1. 1. 16777216. 1. 16777216. 16777216. 16777216.

1. 16777216. 16777216. 1. 16777216. 1. 16777216. 16777216.

1. 16777216. 16777216. 1. 16777216. 16777216. 1. 16777216.

16777216. 1. 1. 16777216. 16777216. 16777216. 16777216. 1.

1. 16777216. 16777216. 16777216. 16777216. 1. 1. 16777216.

16777216. 1. 16777216. 16777216. 1. 16777216. 16777216. 1.

16777216. 16777216. 1. 16777216. 1. 16777216. 16777216. 1.

16777216. 16777216. 16777216. 1. 16777216. 1. 1. 16777216.

Max

[ Max image graphic ]

Max ( { rarray | map } )

real

Max ( iarray )

Max ( sarray ) - returns the string longest common prefix of elements of sarray.

Max ( R1_n R2_n ) → R_max_n - returns the rarray of maximal values.
Max ( index { iarray | rarray } ) - returns the integer index of the maximum-value element of the array (or one of them if many).

Max ( clusterObject ) - returns maximal distance of the root node


cl = Split( t.cluster, Max( t.cluster )/2 )

Max ( index { iarray | rarray } group I_clusterNumbers ) - returns the iarray of indices of maximal values, e.g.


Max(index { 1. 3. 1. 2. 5.} group { 1 2 1 2 2} )
 #>I
   1
   4   # the maximal element 2. has index 4

Min

group ..

Max ( matrix )

Max(Max(

Max ( matrix_nm matrix_nm ) → M_max_nm - returns the matrix with the larger values of the two input matrices of the same dimensions.

Max ( integer1, integer2, ... ) - returns the largest integer argument.
Max ( real1, real2, ... ) - returns the largest real argument.
Max ( S s_leadingString ) returns the maximal trailing number in array elements consisting of the s_leadingString and a number. If there are no numbers, returns 0. E.g. Max({"a1","a3","a5"},"a") returns 5.
Max( *grob | *macro | *sequence | *alignment | *profile | *table | *map )
returns the maximal number of shell objects of the specified class. To increase this shell limit, modify the icm.cfg file.
Max ( grob s_leadingString ) returns the maximal number appended to grob names:

 
  g_skin_1 
  g_skin_2

 
 show Max({2. 4. 7. 4.})          # 7. will be shown

Recommended setting for GRAPHICS.quality in images

Max( image graphic ) - returns the recommended value of GRAPHICS.quality to be used with commands which generate images.

Example:


write image memory GRAPHICS.quality=Max(image graphic)

MaxHKL

an array of three maximal crystallographic h,k,l indices at a given resolution.
MaxHKL( { map | os | [ R_6CellParameters ] }, r_minResolution ) → I_3hkl_limits
the function extracts the cell parameters from map_ , os_ object, or reall array of {a,b,c,alpha,beta,gamma}, and calculates an iarray of three maximal crystallographic indices { hMax , kMax , lMax } corresponding to the specified r_minResolution .

Median

median-value function.

Median ( { rarray | iarray } )
- returns the real median-value of elements of the specified ICM-shell objects.

Examples:

 
 print Median(Count(100))          # returns 50.5

Mean

average-value function.
Mean ( { rarray | map } )
- returns the real average-value of elements of the specified ICM-shell objects.
Mean ( iarray )
- returns the real average-value of the elements of the iarray.
Mean ( matrix )
- returns rarray [1:m] of average values for each i-th column matrix[1:n,i].
Mean ( R1 R2 )
- for two real arrays of the same size returns rarray [1:m] of average values for each pair of corresponding elements.
Examples:

 
 print Mean({1,2,3})          # returns 2.  
 
 show Mean(Xyz(a_2/2:8))      # shows {x y z} vector of geometric  
                              # center of the selected atoms  
 Mean({1. 2. 3.} {2. 3. 4.}) 
 #>R 
   1.5 
   2.5 
   3.5

Min

Min ( { rarray | map } )

real

Min ( index { iarray | rarray } )

Min ( index { iarray | rarray } group I_clusterNumbers )


Min( index { 1. 3. 1. 2. 5.} group { 1 2 1 2 2} )
 #>I
   1
   4   # the minimal element 2. has index 4

Max

group ..

Min ( iarray ) - returns the integer minimum-value element of the iarray.

Min ( R1_n R2_n ) → R_min_n - returns the rarray of minimum values.
Min ( matrix ) - returns the rarray of minimum-value element of each column of the matrix. To find the minimum value use the function twice (e.g. Min(Min(m)) )

Min ( matrix_nm matrix_nm ) → M_min_nm - returns the matrix with the smaller values of the two input matrices of the same dimensions.

Min ( integer1, integer2 ... ) - returns the smallest integer argument.
Min ( real1, real2, ... ) - returns the smallest real argument.
Examples:

 
 show Min({2. 4. 7. 4.})      # 2. will be shown  
 show Min(2., 4., 7., 4.)     # 2. will be shown

Min ( alignment, sequence ) → i_nearestSeq

- returns the integer index of the nearest sequence in the alignment.

To get the name of the nearest sequence, use the Name function. Example:


read alignment s_icmhome+"sh3"
b = Sequence("KKYAKAKYDFVARNSSELSMKDDVLELILDD") # like Eps8 seq
iseq = Min(sh3, b)   # returns 3.
nam  = Name(sh3)[iseq] # "Eps8" is the closest sequence
show $nam

Money

Money ( { i_amount | r_amount}, [ s_format] )

s_format

%m specification for the rounded integral amount;
%.m specification to add cents after dot. The default is "$%.m", i.e. Money(1222.33) returns $1,222.33.
%M the same as %m but with dot instead of comma in the European style
%.M the same as %.m dot and comma are inverted in the European style

 
 Money(1452.39)  # returns "$1,452.39"  
 Money(1452.39,"DM %m")  # returns "DM 1,452"  
 Money(1452.39,"%.M FF")  # inverts comma and dot "1.452,39 FF"

Mod

Remainder

function	description	example
Mod(x,y)	brings x to [0 , y] range	Mod(17.,10.) → 7.
Remainder(x,y)	brings x to [-y/2 , y/2] range	Remainder(17.,10.) → -3.

Mod ( i_divisor, [ i_divider ] )

Mod ( r_divisor, [ r_divider ] )

real

Mod ( iarray, [ i_divider ] )

Mod ( rarray, [ r_divider ] )

 
 phi = Mod(phi)    # transform angle to [0., 360.] range 
 a   = Mod(17,10)  # returns 7

Mol

Mol ( { os | rs | as } )

molecules

os_

rs_

as_

Note that there is an obsolete Mol function to return a mol/sdf formatted string . The up-to-date version of this function is String( X )
Examples:

 
 show Mol( Sphere(a_1//* 4.) )      
           # molecules within a 4 A vicinity of the first one  
           # Sphere function Sphere(as_atoms) selects atoms.

Atom

Res

Obj

Name

Name ( )

Name ( s|S_Path_and_Name ) → s_name|S_name

Path

Name( s_hint [ simple | unique | object ] )

simple : removes non-alpha-numeric symbols from a string and replaces them by underscore.
object : finds a name for a new object starting from s_hint and ensures its uniqueness
unique : checks if the name s_ exists in the ICM-shell. If the name does not exist, it is returned without changes, otherwise a number is appended to the name to guarantee its uniqueness.

 
  Name( " %^23 a 2,3 xreno-77-butadien" simple)     
 23_a_2_3_xreno_77_butadien 
 
  a=1 
  Name("a",unique) 
 a1

Unique molecular object names and unique molecule names in a given object

Name( s_obj_name_hint object unique )

Name( "target", object, unique )

Name( s_mol_name_hint os1_object unique ) returns a unique name for a new molecule in existing object os1_object , e.g. Name( "a", a_2. unique) will return "a" or "a1" if a_2.a already exists

Name of the shell variable

Name( variable any_shell_variable )

- returns a string with a name of a provided shell variable.

Example:


add column t {1 2 3} name="A"
Name( variable t )
Name( variable t.A )

All names of objects in a given class Name ( className ) → S_names

- returns a string array of object names for the specified class. Classes: command,function,macro,integer,real,string,logical,iarray,rarray,sarray,matrix,map,grob,alignment,table,profile,sequence

Subclass of strings: html-objects and scripts Name( string [ html | command ] ) returns the list of html documents or scripts in ICM shell
Name ( { iarray | rarray | sarray | matrix | map | grob | alignment | table | profile | sequence | chemical | reaction | slide | model | tree } selection ) - returns a string array of names of selected objects for the specified class.
Name ( as [full] ) - returns sarray of names of selected atoms,residues,molecules or objects. With option full it returns sarray of the selection expressions (e.g. {'1abc.a/13/ca'} ), one item per array element. Name( .. full ) function is often used to form columns of clickable cells. Example:


read pdb "1zzz"//"3zzz"
add column t Name(a_*.H full) name="A"
set format t.A  "<!--icmscript name=\"1\"\ndsSelection \"%1\" --><a href=#_>%1</a>"

See also: String( as ); Sarray( as )

Name ( as|rs|ms|os field ) - returns sarray of unique names of assigned tags (fields), see also set field name .
Name ( as sequence ) - returns sarray of chemical names of the selected atoms according to the icm.cod file (one-letter chemical atom names are low case, e.g. "c", two-letter names start from an upper-case letter, like "Ca"). The names from the periodic table are used in the wrGaussian macro.
Name ( rs ) - returns sarray of names of selected residues. To obtain a one-letter code sequence, use Sequence( rs_ ) and to convert it to a string use String( Sequence( rs_ )) .
Name ( ms ) - returns sarray of names of selected molecules.

Name ( ms chain ) - returns sarray of chain names of selected molecules.

Name ( chem_array ) - returns sarray of names of chemicals in an array ( see also )
Name ( ms sequence ) - returns sarray of names of sequence linked to the specified molecules ms_ or empty strings.
Name ( ms alignment ) - returns sarray of names of alignments linked to the specified molecules ms_ or empty strings
Name ( ms swiss ) - returns sarray of names of swissprot names corresponding to the specified molecules ms_ or empty strings . See also the set swiss command.
Name( os ) - returns sarray of names of selected objects. E.g. Name( a_ )[1] returns string with the name of the current object.
Name ( vs ) - returns sarray of names of selected variables.
Name ( alignment ) - returns sarray of constituent sequence names.
Name ( table ) - returns sarray or constituent table ICM-shell object names.
Name ( sequence ) - returns string name of specified sequence.

Name( T column ) - returns sarray of column names

Name( T header ) - returns sarray of header names

Name( T selection ) - returns sarray of selected (in GUI) column names

Name( collection ) - returns sarray of keys of the collection

Name( collection s_filter ) - returns sarray of keys of the collection which satisfy the s_filter expression. s_filter can be any logical expression which operates with key or value or both.

Example:


c = Collection( "a" yes, "b" no, "c" yes )
Name( c "value==yes"  )  # returns only "a" and "c"
Name( c "value"  )  # the same as above
Name( c "!value"  )  # "b"

Name( gui {html|table|alignment} )

- returns sarray of shell objects in the order of their tabs appear in the GUI. Notice that the order of tabs corresponding to html-documents, tables or alignments can be changed with drag and drop. It will lead to a different order retuned by the Name function.

Name( foreground {html|table|alignment|slide} ) -returns name of the currently active object (the active tab) in the class.

Examples:

 
 read alignment msf s_icmhome+"azurins"        # load alignment  
 seqnames = Name(azurins)            # extract sequence names  
 
 show Name( Acc( a_/* ) ) # array of names of exposed residues

Name( chemical property ) function

Name( chemical property )

- returns sarray of names of loaded descriptors/models (e.g. "MolLogP") for chemicals.

List name of the fields in SOAP struct object

Name( soapStruct )

See also: SOAP services for further information.

Find the name of the closest sequence in an alignment

Name( ali seq ) → s_nameOfTheClosestSequence

Example:


read alignment s_icmhome+"sh3" # alignment
readUniprotWeb "FYN_HUMAN" 
Name(sh3,FYN_HUMAN) # returns "Fyn"

Name( string .. ) function

Name( string html ) - returns sarray with the names of all the HTML objects in the project

Name( string command ) - returns sarray with the names of all the scripts in the project

Name( tree .. ) function

Name( tree-parray i_parrayIndex [index|label|matrix|sort|split] ) - returns string names of different properties of the tree cluster object. The following names can be returned:

no argument, e.g. Name( T.cluster 1 ) returns the tree name shown in GUI
index : returns the name of a table column containing the unique index number of each row in the order of the data tree (compare with the split option which returns the branch number).
```
a = "T."+Name(T.cluster 1 index)
sort $a  # sorts rows in the tree order
```
label : returns the format string of the tree node label, e.g. "%NAME(%ID)", referring to table columns called t.Name and t.ID. The names need to end with a separator or semicolumn.
matrix : if the distance matrix was used when making a tree, and this matrix was attached to the table header, the shell name of this distance matrix is returned. Example:
```
read table "t.tab"
make tree t matrix "upgma" # attaches distance matrix to the table header
show Name(t.cluster 1 matrix)
```
sort : returns the column name which was used to additionally order the data tree during the tree construction.
split : returns the column name in which the branch order number (at a certain fixed split level) is stored.

Names of chemicals

Name( chemarray )

- returns sarray of names of chemarray.

Note that function does not generate a systematic (IUPAC) name. It uses names from the first line of SD/MOL file. Chemical names can also be set with set name command.

See also: set name other chemical functions

Comments of the stack confomations

Name( conf ) → comments for the global stack

Name( os conf ) → comments for the embedded stack of the object

Names of sequences

Name( seq_parray ) → S_names

- returns sarray with stored names of sequence parray elements

See also: set name sequence

Name or fingerprint chain description

Name( predModel column )

- returns sarray with column names or chemical fingerprint chain information.

The result of this function can be used for analysis of the prediction model results and can used together with Descriptor function.

Each element in the fingerprint part is SMARTS-like expression (some atom properties used in the prediction model cannot be expressed as a valid SMARTS expression)

For the default atom properties the output will look like this:


Name( myModel column )
#>S string_array
[#6;H3]
[#6;H2]-[#6;H3]
[#6;H2]
...

Covalent neighbors of an atom

Next ( as { bond | tree } )

tree

above

Example of a test if a hetatm molecule is covalently attached to a polymer:

Expand by covalent bonds by one bond
Count the number of molecules excluding the source molecule (ms1) itself
if it propagated to other molecules Nof will return a number larger than 0


 read pdb "2vsd"
 ms1 = a_a2
 l_cov_attached = Nof( Mol(Next( ms1 bond)) & !(ms1) ) > 0

 
 build string "his"
 display 
 display a_/his/he2 ball red 
 display Next( a_/his/he2 bond ) ball magenta # show atom preceding he2 
 
 cd2_neigh = Next( a_//cd2 bond ) 
 for i=1,Nof(cd2_neigh) 
   nei = cd2_neigh[i] 
   print "  Distance between a_//cd2 and ",Sum(Name(nei)), " = ", Distance( a_//cd2 nei) 
 endfor

Nof

Nof( X .. )

Nof( P_distArray, distance ) → i_nofDistances # see make distance

Nof( X, s|S_smarts ) → I|M_nOfChemMatches

Nof( X, chiral, {0|1|2|3} ) → nChiralCenters of particular type

Nof( X, library ) → enumeration_library_size

Nof( site rs_|seq_ ) → i_nSites

Nof( model vector ) → i_nLatentVectors

Nof( drestraint|residue|vrestraint type ) → i

Nof( as {bond [error] | selftether [error] } ) → i

Nof( rs|ms {atom | tautomer } ) → I #in each res. or mol.

Nof( stack ) → the number of stack atoms

Nof( os_1, "bio" ) → nBiomol # see: Select|Transform(os "bio" i)

Nof( [os_1] stack ) → the number of stack confs

Nof( fork ) → the number of CPU cores available

Nof( map grid ) → the number of 4D layers in map

Nof( g {1|2|3} ) → nVertex|nLines|nTriangles

Nof( s s_substring [pattern] ) → nMatches

Nof( s_file.ob|.cnf ) → number of objects in .ob, or conformations in .cnf files

Nof( tree [i_at=1|all] ) → nEntries

Nof( tree [i_at=1] tree|auto ) → nClusters|nSuggestedClusters

Nof( idb s_indexedIDBfile ) → nRecords

Nof( S|I s|i ) → i_nMatched_el

Nof( S s regexp ) → I_nMatches_in_each

Nof( s_table_name sql [s_connectionID] ) → nRows

Nof( s_table_name molcart unique ) → nUniqueChemicals
Nof ( className )

- returns integer number of objects in a class (e.g. Nof(sequence) ). Classes: iarray,rarray,sarray,sequence,aselection,vselection,alignment,matrix,map,grob,string,object

Nof ( { iarray | rarray | sarray | chemarray | parray } ) - returns integer number of elements in an array. Note that distanceParrays or hbondParrays returned by the make distance of make hbond commands have a two-level structure in which the actual list of bonds or distances is the nested to the main level of this parray. Therefore to get the number of distances or hbonds one needs to use the following function.

Nof ( hbondChunkArray|distChunkArray distance ) - returns the total number of nested atom pairs.

Nof ( ali ) - returns integer number of sequences in a specified alignment ali_ (see also Length( alignment ) ).
Nof ( matrix ) - returns integer number of rows in a matrix (see also Length( matrix) function which returns number of columns).
Nof ( table ) - returns integer number of number of rows in a table

Nof ( map ) - returns integer number of grid points in a map.
Nof ( grob ) - returns integer number of points in graphics object.
Nof( { os | ms | rs | as | vs } ) - returns integer number of selected objects, molecules, residues, atoms or variables respectively.

Nof( { as on|off | bond [error] | selftether [error] ) - returns integer number of atoms that are hidden ( off ) or present ( on ). See also set as on | off . Also counts bonds, bonds and selftethers

Nof( { rs|ms atom | tautomer ) - returns iarray with number of atoms or tautomers in each residue or molecule. Depends on selection levels.

Nof ( { atoms | residues | molecules | objects | conf | stack | tether | vrestraint } ) - returns the total integer number things.

atoms : same as Nof(a_*.//*) , except that Nof(atoms) will work even if the object does not exist.
residues : same as Nof(a_*.*/*)
molecules : same as Nof(a_*.*)
objects : same as Nof(a_*.)
conf : number of confomations (`conf) in the global stack
stack : number of atoms in an object which was used to create a stack , e.g. Nof( a_2. stack )
tether : the total number of tethers in the current object, same as a_//T
vrestraint

Nof ( fork )

fork

wait

Nof ( library ) - returns 1 if the force field parameter library is loaded and 0 otherwise.
Nof ( library ) - returns 1 if the mmff library is loaded and 0 otherwise.
Nof ( plane ) - returns the number of active graphical planes
Nof ( site [ ms | seq ] ) - returns integer number of sites in the selected molecule or the current object or sequences.

Nof ( os1 stack ) - returns integer number of conformations in a built-in stack of a specified object.
Nof ( s_objFileName.ob ) or Nof( s_stackFileName.cnf ) - returns integer number of conformations in an object file, or conf in a stack conformation file.
Example:

 
 nObj =  Nof("target.ob")
 if(nObj >=2) read object "target.ob" number=2

 for i=1,Nof("def.cnf")  # stack is NOT loaded 
   read conf i "def.cnf"
 endfor

Nof ( os_singleObj stack ) - returns integer number of conformations in the object stack. Note that stack stored in object is not the same as the global shared stack. E.g.


build string "HWEH"
montecarlo store # creates stack and stores it in object
Nof(a_ stack)  # returns the number of conformations in object stack

Nof ( string, substring )

substring

string.

Nof("ababab","ba")

2

Nof ( string, substring, pattern )

find database pattern=s_pattern

Nof("ababab","b?",pattern)

2

dn1:

 
 if(Nof(String(dn1),"[!ACGT]" pattern) > 0.5*Length(dn1)) print " Warning> Bad DNA sequence"

Nof ( className selection )

- returns integer number of selected ICM-shell variables. This selection does not work the following types: aselection, vselection, string , object .
Examples:

 
 nseq = Nof(sequences)         # number of sequences currently loaded  
 if(Nof(object)==0) return error "No objects loaded" 
 
 if ( Nof( sequence selection ) == 2 ) a = Align( selection )

Nof ( {table|alignment|grob} display )

- returns integer number of displayed ICM-shell variables.


Nof( grob display )   # number of meshes displayed in 3D
Nof( table display )  # number of spreadsheets visible in GUI

Counting clusters

Nof ( tree [i_index=1] ) - returns integer number of entries in the cluster tree.

Nof ( tree [i_index=1] tree ) - returns integer number of clusters at current split level

Nof ( tree [i_index=1] auto ) - returns an integer guess for a recommended number of clusters

Counting various properties/patterns in chemical arrays.

Nof( chemarray, "ring" ) - returns an iarray containing the number of rings in each array element.


 show Nof(Chemical("C1CC2CC1CCC2") "ring" )

Nof( chemarray, "minRing" ) - returns an iarray of max ring sizes

Nof( chemarray, "maxRing" ) - returns an iarray of min ring sizes

Nof( chemarray, chiral [ 1|2|3 ] ) - returns the number of chiral or racemic centers as follows:

any chiral or racemic center : the default, e.g. Nof(t.mol chiral)
R centers: type = 1 ,
S centers: type = 2
RS or racemic centers : type =3.

add column t Nof(t.mol chiral 3) "nRacemicCenters"

Occupancy

Occupancy ( { as | rs } )

rarray

set occupancy

 
 read object s_icmhome+"crn.ob"
 avO=Min(Occupancy(a_//ca))     # minimal occupancy of Ca-atoms  
 show Occupancy(a_//!h*)        # array of occupancy of heavy atoms  
 color a_//* Occupancy(a_//*)   # color previously displayed atoms  
                                # according to their occupancy 
 color ribbon a_/A Occupancy(a_/A) # color residues by mean occupancy

Path

Path ( )

Path(directory)

Path ( s|S_FullFileName ) - returns header sub- string (or sarray) with the path(s). Example: Path("a/b/c/dd.icb") returns "a/b/c/" See also Name and Ext
Path ( s_FullFileName full ) - returns the full file name including absolute directory, and the filename with extension
Examples:

 
 sPath=Path("/usr/mitnick/hacker.loot") # returns "/usr/mitnick/"  
 Path("~/.cshrc" full )
 /home/crepe/.cshrc

Name

Extension

Path ( indexTable ) → s_sourceFile

returns the string path to the source data file for the indexTable . The full name is returned by the File function. Example:


write index mol "/data/chem/nci.sdf" "./nci.inx"
read index "./nci.inx"
Path(nci)  # returns location of the source nci.sdf file
 /data/chem/

write index

File

T_indexTable

Path ( origin [ S|s_script_with_args [S_args] ] )
- returns a path to the ICM executable and optional arguments. It is useful when you want to call a named icm-script with arguments in a multi-platform compatible way. ICM binary can also be found in the Version(full) string.

Example in which Path finds icm executable runs the script with it:


  Path(origin "myicmscript.icm file.icb -v -max=2.3")  
 /pro/icm/icm/icm  myicmscript.icm file.icb -v -max=2.3

  Path(origin, "myicmscript.icm",{"file.icb", "-v", "-max=2.3"})  
 /pro/icm/icm/icm  myicmscript.icm file.icb -v -max=2.3

  Path(origin, {"myicmscript.icm","file.icb", "-v", "-max=2.3"})  
 /pro/icm/icm/icm  myicmscript.icm file.icb -v -max=2.3

Versions before 3.5-2 used Path(unix,[]) or Path(macro,[..]) syntax.
Path ( directory )
- returns the current working directory.
Path ( last )
- returns the path of the last icm-shell script called by ICM.

Path ( preference )
function obsolete

Note, that the current version of ICM stores user preferences in the ~/.config/Molsoft.conf file under Linux.)

Path ( s_somePath fix )

- returns a string with simplified separators (useful when you want to compare different paths)

Example:

 
Path("/home/"+"/"+"theuser//" fix ) # == "/home/theuser/"

Parray

function returning an array or 'pointers' to various types of data-objects in ICM, for example an array of chemical compounds. The data-objects types of an element (one cell) may include: chemical, image, grob, object, sequence, sarray, iarray or rarray, and collection.

create an empty parray of the specified cell/element-type. To store images, use the Image function, to store logicals see below. Extracting the parray elements back to shell objects can be done with one of three methods:

direct assignment: e.g. a=t.A[1] : works for i|r|sarrays , sequences
assignment via a type function: e.g. a=Grob(t.A[1]) : works for grobs, images (see Image), maps,
load object : e.g. load object t.A[1] name='x' : works for molecular objects

Parray ( s_smiles smiles )

Parray ( s_molFileText mol )

Parray ( matrix ) add a column of row-arrays. E.g.


add column t Parray(Matrix(10)) 
set property plot t.A

Parray ( model s_modelName ) - returns an empty parray of type model It has two reserved fields "type" (set to "Custom" ) and "dim" . This object also behaves as a collection which can hold additional named elements.

Parray ( object ) - returns an object parray containg all ICM molecular objects loaded

Parray ( object os [stack] ) - returns an object parray of ICM molecular objects from the object selection. If stack keyword is specified, the current stack is stored into the object.

Parray ( sequence rs ) - returns a sequence parray of size 1 containing the residues specified.

Parray ( sequence [selection] ) - returns a sequence parray containing all sequences loaded into ICM (with the selection option only the GUI-selected ones).

Parray ( sequence|object i_n ) - returns a sequence parray or object parray containing i_n empty objects

Example:

 
read table mol "ex_mol.mol" name="t" 
s  = String(t.mol[1]) # sss contains mol/sdf text 
t.mol[1] = Parray(s mol)  # sss is parced and converted

Pattern

Pattern ( { s_consensus | ali } [ exact ] )

sequence pattern

find pattern

s_consensus

pattern

"R+. ..^D"

"R[KR]?\{3,6\}[ACGS]D"

ali_

exact

A

V

[AV]

exact

[AFILMPVW]

exact

 
 read sequence s_icmhome + "zincFing" 
 group sequence aaa 
 align aaa 
 show Pattern("#~???A%  ?P")  # symbols from consensus string 
 show Pattern(aaa) 
 show Pattern(aaa exact)

Pattern ( s_seqPattern prosite | residue )

prosite

s_seqPattern

pattern

Pattern ( rs ) → s_res_barcode - returns string "barcode" with selected residues followed by the length of the intervening gaps. This function can be applied with the 'B' and 'Q' residue selections. E.g.


read pdb "1xbb:
Pattern(Res( Sphere( a_H a_A -1.1)))
 A47ME1AE1G46L
a_*.*/BA47ME1AE1G46L

Pattern ( seq disulfide ) → s_Cys_pattern - e.g. Pattern(1crn_a) returns "C??C???...C"-style pattern

Pi

Pi

real

3.14...).

 
 print Pi/2.

Potential

Potential ( as_targets as_charges )

How to evaluate the pK shift

as_targets)

as_targets

as_charges

REBEL

make boundary

 
 read object s_icmhome+"crn" 
            # prepare electrostatic boundary descriptions 
 make boundary  
            # potential from oe*, od* at cz of two args 
 show Potential(a_/arg/cz a_/glu,asp/o?* ) 
 print 0.5*Charge(a_//*)*Potential(a_//* a_//* )  
            # the total electrostatic energy which is 
            # actually calculated directly by show energy "el"

Power

Power ( r_base, { r_exp | i_exp} )

real

r_base^r_exp

r_base^i_exp.

r_base

Power ( r_base, R_exp )

r_base

R_exp

Power ( R_base, r_exp ) - returns rarray with each of the R_base elements taken to the r_exp power.
Power ( R_base, R_exp ) - returns rarray with R_exp powers of the according R_base elements. Input arrays should have the same size.

Power ( r_base, M_exp ) - returns matrix of the r_base taken to the M_exp powers.
Example:

 
 Power(2.,{1. 2. 3.}) # returns {2.,4.,8.}

Power ( rarray, r_Exponent )

Power ( matrix, integer )

n-th

inverse

 
 size=Power(tot_volume,1./3.)        # cubic root  
 
 read matrix "LinearEquationsMatrix" # read matrix [1:n,1:n] 
 read rarray b                       # read the right-hand column [1:n] 
 x=Power(LinEquationsMat,-1) * b     # solve system of linear equations 
 
 a=Rot({0. 1. 0.}, 90.0)                 
       # create rotation matrix around Y axis by 90 degrees  
 if (Power(a,-1) != Transpose(a)) print "Wrong rotation matrix" 
                            # the inverse should be  
                            # equal to the transposed  
 rotate a_1 Power(a,3)      # a-matrix to the third is  
                            # three consecutive rotations

Profile

Profile ( chem i_at1 i_at2 i_at3 i_at4 )

Profile ( chem selection )

Profile ( v_tor {group|randomize} )

Returns torsion statistical profile predicted by graph Convolutional Neural Network (GCNN) trained on structures from Crystallography Open Database (COD) J Chem Inf Model. 2022 Dec 12;62(23):5896-5906. doi: 10.1021/acs.jcim.2c00790. Epub 2022 Dec 1.

Profile ( alignment ) - creates profile from an alignment

Property

Property ( grob option )

grob

Available options:

full -- dual in-out lighting of the grob
surface -- whether only the front face of the triangles is shown (back faces culled)
texture -- allow textures
material -- allow materials
grid -- (requires texture). Use only the fractional part of the texture coordinates (allows repetitive pattern textures)
heavy -- grob lids on sliced surfaces

set property

Putarg

Putarg( s_name s_value )

- adds a name-value pair to the list of ICM arguments. Returns no in case of error.

See Getarg .

Putenv

function to change or add value to environment.

Putenv ( " s_environmentName = s_environmentValue " )

-returns a logical yes if the named shell-environment variable is created or modified.
Putenv ( )

- function to push the icm or icm-script arguments (see Getarg ) into the unix shell as shell arguments. Returns the number of set variables. To eliminate an agrument from the list, use the Getenv ( s_argName delete ) function.

Examples:

 
show Putenv("aaa=bbb") # change/add variable 'aaa' with value 'bbb' to environment 
show Getenv("aaa")     # check if it has been successful

See also: Existenv, Getenv, Getarg.

Radius

Radius ( as )

real array

(skin)

icm.vwt

Radius ( as ball ) - returns the rarray of the graphical radii of the xstick or ball representation. Normally they are defined by the GRAPHICS.stickRadius parameter and GRAPHICS.ballStickRatio . They can be set to custom values with the set atom
Radius ( as surface ) - returns the rarray of the 'hydration' atomic radii.
These radii are used in construction of the solvent-accessible (surface) and can be found (and possibly redefined) in the icm.hdt file.
Radius ( as charge ) - returns the rarray of the 'electrostatic' atomic radii.
These radii are used for building the skin (analytical molecular surface) for electrostatic dielectric boundary calculation with electroMethod = "boundary element". These parameters can be found (and possibly redefined) in the icm.vwt file.

Random

[ Random string ]

Random ( )

real

Random ( i_max )

i_max

Random ( i_min , i_max )

i_min

i_max

Random ( r_min , r_max )

real

r_min

r_max

Random ( r_min , r_max , i_n )

i_n

r_min

r_max

Random ( r_mean , r_std , i_n , "gauss" )

i_n

Random ( r_min , r_max , i_nRows , i_nColumns )

i_nRows

i_nColumns]

r_min

r_max]

Random ( i_nRows, i_nColumns, r_min, r_max )

i_nRows

i_nColumns]

r_min

r_max

 
 print Random(5)            # one of the following: 1 2 3 4 or 5  
 print Random(2,5)          # one of the following: 2 3 4 or 5  
 print Random(2.,5.)        # random real in [2.,5.]  
 randVec=Random(-1.,1.,3)   # random 3-vector with components in [-1. 1.]  
 randVec=Random(3,-1.,1.)   # the same as the previous command  
 randMat=Random(-1.,1.,3,3) # random 3x3 matrix with components in [-1. 1.]  
 randMat=Random(3,3,-1.,1.) # the same as the previous command   
 Random(0., 1., 10, "gauss" ) # normal distribution

Random strings

Random( I_lengths s_alphabet ) - returns an sarray of random strings of lengths specified in the I_lengths array. Strings are comprised from the characters specified in the s_alphabet. Alphabet specifications are the same as character set specifications in regular expressions: "A-Z", "\\w", "\\dA-Fa-f", "ACGT".

Random( i_n S_words ) - returns an i_n element sarray consisting of the words specified in the S_words array repeated in random order.

Examples:


Random( Iarray(10,20) "\\dA-Z" ) # returns 10-element sarray with 20-character strings containing random digits and capital letters
Random( 100, {"rock","paper","scissors"} ) # returns 100-element sarray consisting of words "rock", "paper" and "scissors" in random order
Random( 100, Random( Iarray(10,3) "a-z" ) ) # returns 100-element sarray containing 10 random words from the "a-z" alphabet

Rarray

[ rarray sequence projection | Rarrayinverse | R property transfer via alignment | Rarray properties | RarrayAlignment ]

Rarray ( i_NofElements )

i_NofElements].

Rarray(0)

Rarray ( i_NofElements, r_Value )

i_NofElements

r_Value.

Rarray ( i_NofElements, r_From, r_To )

i_NofElements

r_From

r_To.

Rarray ( r_From, r_To , r_step )

r_From

r_To.


  Rarray( 3.1, 15. 0.1)

Rarray ( iarray )

iarray

Rarray ( sarray )

sarray

Rarray ( sarray s_patternForValue1 ) - converts sarray into a rarray of 1. and 0. The value is 1. if an element if an array matches the string. E.g. Rarray({"M","W","M","E","W"},"M") # returns {1. 0. 1. 0. 0.}

Rarray ( R n_significantDigits ) → R_rounded - rounds the input array to the specified number of significant digits. If n is out of bounds ( less than zero or more than 12, the function switches to the default of 2.
Rarray ( M [ i_flag ] ) - extracts different groups of elements of the matrix and casts them into a rarray. There are seven (7) possibilities for a matrix n rows by m columns:

all elements by rows (the default) , n*m
all elements by columns, n*m
the upper triangle with diagonal, n(n+1)/2
the lower triangle with diagonal, n(n+1)/2
the diagonal elements, n
the upper triangle without diagonal elements, n(n-1)/2
the lower triangle without diagonal elements, n(n-1)/2

 
 a=Rarray(54)                     # create 54-th dimensional vector of zeros 
 a=Rarray(3,-1.)                  # create vector {-1.,-1.,-1.} 
 a=Rarray(5,1.,3.)                # create vector {1., 1.5, 2., 2.5, 3.} 
 a=Rarray({1 2 3})                # create vector {1. 2. 3.} 
 a=Rarray({"1.5" "2" "-3.91"})    # create vector {1.5, 2., -3.91} 
# 
 M=Matrix(2);M[2,2]=2;M[1,2]=3 
 Rarray( M ) 
 Rarray( M 1 ) 
 Rarray( M 2 ) 
 Rarray( M 3 ) 
 Rarray( M 4 ) 
 Rarray( M 5 ) 
 Rarray( M 6 ) 
 Rarray( M 7 )

rarray sequence projection

Rarray ( R_ali ali_from { seq | i_seqNumber } )

R_ali

ali_from.

seq_

R_ali

seq_.

Projecting from one sequence to another sequence via alignment.

seq1

seq2

ali

seq1

seq2

two transfers

seq1 to ali : RA = Rarray( R1 seq1 ali r_gapValues )
ali to seq2 : R2 = Rarray( RA ali seq2 )

seq1

r_gapValues

`String`( s_,R_,ali_,seq_ )	function to project strings
`Rarray`( R_seq seq_ ali_to r_gapDefault )	function to project from sequence to alignment
`Probability`()	function

Reversing the order of elements in an array

Rarray ( rarray reverse )

 
Rarray({1. 2. 3.} reverse)  # returns {3. 2. 1.}

Transfer real sequence properties by alignment

Rarray ( R_seq { seq | i_seqNumber } ali_to r_gapDefault )

seq_

ali_to

R_seq

seq_.

r_gapDefault

 
read alignment s_icmhome+"sh3" 
t = Table(sh3) 
group table t Count(Nof(t)) "n" append  # add a column with 1,2,3,.. 
show t  # t looks like this: 
 #>T t 
 #>-cons--------Fyn---------Spec--------Eps8-------n--- 
    " "         0           1           1          1 
    " "         0           2           2          2 
    " "         0           3           3          3 
    " "         0           4           4          4 
    .           1           5           5          5 
 ... 
t2forFyn =  t.Fyn == 2   # table row for position 2 in seq. Fyn 
t2forFyn.n               # corresponding alignment position

String

Assign arbitrary amino-acid property to a sequence

Rarray ( sequence R_26resProperty )

R_26resProperty

sequence.

 
 s= Sequence("TTCCPSIVARSNFNVCRLPGTPEAICATYTGCIIIPGATCPGDYAN") # crambin sequence 
# 26-dim. hydrophobicity vector for A,B,C,D,E,F,.. 
 h={1.8,0.,2.5,-3.5,-3.5,2.8,-.4,-3.2,4.5,0.,-3.9,3.8,1.9,-3.5,.0,-1.6,-3.5,-4.5,-.8,-.7,0.,4.2,-.9,0.,-1.3,0.} 
 hs=Rarray(s,h)              # h-array for each sequence position 
 hh = Smooth(Rarray(s,h), 5) # window average

Calculating array of alignment strength values for each column

Rarray ( ali [ simple|exact ] )

n*(n-1)/2

exact

comparison matrix

simple

d

n(n-1)/2

The values returned with both simple option and the default are therefore between 1/n (all residues are different) and 1.
To project the resulting array to a specific sequence, use the Rarray( R_ ali_ seq_ ) function (see above).
To calculate conservation with respect to a particular set of residues in a structure, use the Score( rs_ [ simple ] ) function.

To project the conservation onto a 3d chain with its linked sequence in alignment use the Rarray( R_conserv alignment seq3d) projection function. e.g.
Example:

 
read alignment s_icmhome+"sh3" 
show Rarray(sh3) 
# 
a=Rarray(sh3 simple)  # a number for each alignment position 
# to project a to a particular sequence, do the following 
b=Rarray(a,sh3,Spec)  # a number for each Spec residue 
String(Rarray(a, sh3, Spec ))//String(Spec)   # example

See also:

Score( ali [simple])
Entropy( ali [simple|info])

Real function

Real ( integer )

real

Real ( string )

real

 
 s = "5.3"         
 a = Real(s)      # a = 5.3  
 s = "5.3abc"     # will ignore 'abc'    
 a = Real(s)      # the same, a = 5.3

Toreal

Remainder function.

Returns the remainder; similar to, but different from the Mod function.

function description example
Remainder(x,y) brings x to [-y/2 , y/2] range Remainder(17.,10.) → -3.
Mod(x,y) brings x to [0 , y] range Mod(17.,10.) → 7.

Remainder ( i_divisor, [ i_divider ] ) - returns the integer
Remainder ( r_divisor, [ r_divider ] ) - returns the real remainder r = x - n*y where n is the integer nearest the exact value of x/y; if | n-x/y|=0.5 then n is even. r belongs to [ -|y|/2, |y|/2 ] range
Remainder ( iarray, [ i_divider ] ) - returns the iarray of remainders (see the previous definition).
Remainder ( rarray, [ r_divider ] ) - returns the rarray of remainders.
The default divider is 360. (real) or 360 (integer) since we mostly deal with angles.
Examples:

 
 read object s_icmhome+"crn.ob"
              # transform angle to the standard  
              # [-180., 180.] range. (Period=360 is implied)  
 phi=Remainder(Value(v_//phi))
 
              # we assume that you have two objects 
              # with different conf. of the same molecule

Reference

Reference( seq [ s_fieldName ] )

- returns the swissprot database reference if available. It is possible to specify the requested field name; the default is "DR".

Replace

[ Replace exact | Replace simple | Replace regexp | Chemical replace ]

Replace( s|S s_regexp s_by regexp [i_field=0] ) → s|S (regular expressions, and case-sensitivity, - see below).
Replace ( s_source s_icmWildcard s_replacement ) - returns a string, which is a copy of the source string with globally substituted substrings matching s_icmWildcard by the replacement string s_replacement.
Example:

 
 a=Replace(" 1crn "," ","")  # remove empty space

Replace ( s_source S_fromArray S_toArray )

Sequence

seq_

reverse

 
 invertedSeq = String(0,1,"GTAAAGGGGTTTTCC")  # result: CCTTT.. 
 complSeq=Replace(invertedSeq,{"A","C","G","T"},{"T","G","C","A"}) 
# result: GGAAA...

Replace ( s_source S_fromArray s_replacement )

s_replacement

 
 cleanStr=Replace("XXTEXTYYTEXT",{"XX","YY"},"")

Replace ( S s_icmWildcard s_replacement )

 
 aa={"Terra" "Tera" "Teera" "Ttera"} 
 show column aa Replace(aa "er?" "ERR") Replace(aa "*[tT]" "Shm")

Replace ( S S_fromArray S_toArray ) or

Replace ( S k_translation_name-val_collection [s_nonMatchFormat]))

- returns a sarray with multiple substitutions.
Replace ( S s_icmWildcard s_replacement ) - returns a sarray with multiple substitutions to a single string.

Replace a matching element of an array with another string.

Replace( S s_completeString s_by exact ) → S

Search a string array and find an element which matches the full s_completeString, e.g. the "never again" element of S will only be matched with the "never againg" string, but not with "never" .

A straight forward substitution. Replace without any interpretation of the search string.

Replace( s|S s_whatAsIs s_byAsIs simple ) → s_|S

In this case there is not intepretation of the query string. The first occurrence of it is replaced with the second argument. Example:


s="a[b]()c"
Replace(s,"[b]","()$",simple) # no intepretation

Replace using regular expressions

Replace( s|S s_regexp s_by regexp [i_field = 0] ) → s|S

- replace the s_regexp in the source string or array by s_byRegexp using regular expressions. The latter is a string which may contain back-references.

Example: case-insensitve replacements:


  Replace("bla 1"//"Bla 2"//"BLA 3" , "(?i)bla ","") # get rid of bla

Example:


  read string "t.html"
  s_out = Replace( s_out, "(?n)<i>(.*?)</i>", "<b>\\1</b>" regexp )   # replace italic with bold
  s_out = Replace( s_out, " +", " ",regexp )   # replace multiple spaces with a single on

Note that "(?n)" modifier is needed to make '.' match newline too.

Dehtml-tagging of the html text in a string or a string array:Prep work in html conversion is usually this: <> S = Replace( S, "
","\n", exact) S = Replace( S, "

","\n\n", exact) S = Replace( S, " "," ", exact) # finally remove all tags S = Replace( S, "<.*?>","",regexp) <>

Example in which we remove href html tags from a column in a table :


read table html "http://pfam.sanger.ac.uk/search/keyword?query=sh2" name = "sh2t"
sh2t.ID = Replace(sh2t.ID, "<.*?>","",regexp)

Chemical replace

Replace( chem , s_smartFROM, s_smileTO [exact] )

Finds a chemical pattern containing one of several Rn groups and replaces the pattern to the s_smileTO pattern according to the matching R-groups. Note that all atoms except the ones connected to the R-groups in s_smartFROM pattern will only match exactly the same local pattern.

The molecules will be redrawn in 2D after the replacement. The exact option will supress the redrawing if the number of atoms in the FROM and TO patterns is the same.

Example in which we created a newe table tt with a modified column:


read table mol "drugs.sdf"
add column tt Replace(drugs.mol, "[R1]C(=O)O","[R1]C(=O)OC")

See also:

modify chemarray s_pattern s_repl [exact] will modify in place. This replacement can be done only for the "terminal" fragments (one attachment point)

`Trim-chemical{Trim} ( X [s_smart] ...) will iteratively delete selected atom pattern, e.g. "[*;D1]"

Res

residue selection function.
Res ( { os | ms | rs | as } [ append ] )

Res ( { rs [ append ] )
- selects residue(s) related to the specified objects ( os_), molecules ( ms_) or atoms ( as_), respectively. Option append extends the selection with the terminal residues (like Nter and Cter in peptides)
Examples:

 
 show Res( Sphere(a_1/1/* 4.) )   # show residues within 4 A  
                                  # vicinity from the firsts one

Atom

Mol

Obj

Res(ali ..): from sequence positions in sub-alignment to residue selection

Res ( ali { seq | i_sequence } )

residue

1crn_m

1

Resolution

Resolution ( )

real

Resolution ( os_object )

 
 sort object Resolution(a_*.) # resort objects by resolution
 res=Resolution(a_1crn.)[1]
 print "PDB structure 1crn: resolution = ", res, " A"

Resolution ( s_pdbFileName pdb )

real

Resolution ( T_factors [ R_6cell ] )

structure factor table

R_6cell

defCell

 
 read factor "igd"   # read h,k,l,fo table from a file 
 read pdb "1igd"     # cell is defined there 
 defCell = Cell(a_)  # extract the cell parameters from the object 
 group table append igd Resolution(igd) "res" 
 show igd

Ring

Ring( as ) - returns logical yes all atoms from the selection belong to one ring

Ring( vs ) - returns subset of input variable selection which belongs to one ring

Ring( chemical ) - returns chemical array of ring system(s)

Ring( chemical simple ) - returns chemical array of the smallest set of smallest rings (SSSR) Example:


show Smiles( Ring( Chemical("C(=CC=CC1C(=CC=C(C2C3)C=CC=3)C=2)C=1" ) ) unique )
show Smiles( Ring( Chemical("C(=CC=CC1C(=CC=C(C2C3)C=CC=3)C=2)C=1" ) simple ) unique )

Rfactor

crystallographic R-factor.
Rfactor ( T_factors ) - returns the real R-factor residual calculated from the factor-table elements T_factors.fo and T_factors.fc. Reflections marked with T_factors.free = 1 are ignored.

Rfree

Rfree ( T_factors )

real

factor-table

T_factors.fo

T_factors.fc.

T_factors.free

Rmsd

Rmsd ( { iarray | rarray | matrix | map } )

real

Rmsd ( Rn Wn )

- returns the real weighted rmsd and weighted mean as r_out according to this formula:


xw = Sum(w[i]*x[i])/Sum(w[i])  # the weighted mean
np    # is the number of non-zero weights
sdw2 = Sum(w[i]*(x[i]-xw)^2) / (((np-1)/np)*Sum(w[i]))
sdw  = Sqrt(sdw2)

Rmsd ( as_tetheredAtoms )

returns the real root-mean-square-deviation of selected atoms

tethered

after

Srmsd

as_

R_out

Rmsd ( ms_select1 ms_select2 chemical [output] )

- returns the real root-mean-square distance between two selected chemical (hetero) molecules after an optimal chemical superposition via graph-matching is performed. In this mode atom equivalence can be found either by substructure search or (if none of molecules is substructure of other) by common substructure search algorithm. Other feature of chemical mode is that it enumerates topologically equivalent atoms to find best superposition. The maximal common substructure will be used for the calculation. Option output will produce R_2out array with individual deviation for the matched pairs. Rmsd(R_2out) will essentially be the overall Rmsd, but one will be able to measure the maximal and median deviation as well.

See also Srmsd( ms1 ms2 chemical ) and superimpose command.

Rmsd ( chemarray ms_select2 [pharmacophore] )

- returns real array of root-mean-square distances between each element of chemarrayand ms_select2.

pharmacophore toggles pharmacophore superposition. ms_select2 - pharmacophore template.

Rmsd ( as_pharmTemplate as_select2 pharmacophore )

- returns the real root-mean-square distance between pharmacophore points of as_pharmTemplate and as_select2after an optimal superposition of as_pharmTemplate

Rmsd ( as_select1 as_select2 [ { { ali|align } | exact } ] )
- returns the real root-mean-square-distance between two aligned sets after these two sets are optimally superimposed using McLachlan's algorithm.
Virtual atoms. Be default, the first two virtual atoms ( vt1 and vt2 ) are automatically excluded from both selections unless the virtual option is explicitly specified.
The optional third argument defines how atom-atom alignment is established between two selections (which can actually be of any level atom selection `as_ , residue selection `rs_ , molecular selection `ms_ , or object selection `os_ , see alignment options). Number of equivalent atom pairs is saved in i_out . Two output selections as_out and as2_out contain corresponding sets of equivalent atoms. This function also returns the transformation in the R_out array.
See also: superimpose and Srmsd ().
Examples:

 
 read pdb "1mbn"                          # load myoglobin  
 read pdb "1pbx"                          # load alpha and beta 
                                          # subunits of hemoglobin  
 print Rmsd(a_1.1 a_2.1 align)            # myo- versus alpha subunit 
                                          # of hemo- all atoms 
 print Rmsd(a_1.1//ca a_2.1//ca align)    # myo- versus alpha subunit 
                                          # of hemo- Ca-atoms 
 
 print Rmsd(a_1./4,29/ca a_2.1/2,102/cb exact) # exact match

Rot

Rot ( R_12transformVector )

transformation vector

Rot ( R_axis , r_Angle )

R_axis

r_Angle.

Axis

r_out

 
         #  rotate molecule by 30 deg. around z-axis  
 rotate a_* Rot({0. 0. 1.},30.)

Rot ( R_3pivotPoint R_3axis , r_Angle )

transformation vector

R_axis

r_Angle

R_3pivotPoint

 
# rotate by 30 deg. around {0.,1.,0.} axis through the center of mass 
 nice "1crn" 
 R_pivot = Mean(Xyz(a_//*)) 
 transform a_* Rot(R_pivot,{0. 1. 0.}, 30.)

Sarray

[ Sarray index ]

Sarray ( integer )

Sarray ( integer s_Value )

s_Value

Sarray ( integer S_ids ) - returns sarray of unique strings using the ID seeds. Examples:


Sarray(10,Sarray(0)) # ID1 ID2 etc.
Sarray(10,{"A"})
Sarray(10,{"A","B","C"})
#
read csv header name='b' s_icmhome + "bnames.csv" # 2K baby names
Sarray(10000,Shuffle(Unique(b.name,sort)))  # useful for generating identifiers in tables

Sarray ( s_wildCard directory [simple|all] ) - returns sarray of file names with full path to them. With 'simple' option only file names are stored in the result array. all toggles recursive search in sub-folders.

Example:


  Sarray( "*.pdb" directory )
  Sarray( "/home/user/*.ent*" directory )
  if (Nof( Sarray("*.png") )==0) print "No images found"

Sarray ( string ) - converts the input string into a ONE-dimensional sarray . To split a string into individual lines, or to split a string into a sarray of characters, use the Split() function.
Sarray ( iarray|rarray|sarray ) - converts input arrays into an sarray.

Sarray ( sarray [32] hash ) - generates 32 char or 26 char MD5 based has string. Example in which we create a unique chemical id:


  add column t Chemical({"C1CCCC1","CCO"})
  add column t Sarray(Smiles( t.mol unique cistrans ), 32 hash)

Sarray ( rs [ { append | name | residue } ])

l_showResCodeInSelection

{"a_a.b/2:5", "a_a.b/10:15",..}

residue

append : will merge residue ranges
name : will return a string array of residueName residueNumber records
residue : will return an array of selection strings a_obj.mol/residueNumber records

 
Sarray(a_/2,4:5 name) 
 #>S string_array 
 def.a1/ala2 
 def.a1/trp4 
 def.a1/glu5 
Sarray(a_/2,4:5 residue) 
 #>S string_array 
 def.a1/2 
 def.a1/4 
 def.a1/5 
Field(Sarray(a_/2,4:5 name),2,"/") # extract residues 
#>S string_array 
 ala2 
 trp4 
 glu5

one-letter

a_/^F23

a_/23

Name( rs full ) : preferred way to generate arrays of residue selection strings
String( rs_ ) which returns one string;
Label( rs_ | as_ ) which will format the output string according to the resLabelStyle or atomLabelStyle preference.
l_showResCodeInSelection

Sarray ( stack, vs_var )

stack

'B': -200 < phi < -80 , 140 < psi < 200
'A': -101 < phi < -24 , -81 < psi < 4
'g': -169 < phi < -15 , -64 < psi < 54
'd': -211 < phi < -5 , 8 < psi < 136
'L': 24 < phi < 101 , -4 < psi < 81
'_': the rest

'M': -120 <= xi1 < 0
'P': 0 <= xi1 < 120
'T': 120 <= xi1 < 240

 
 show Sarray(stack,v_/2:10/x*)      # coding of side-chain conformations 
 show Sarray(stack,v_//phi,psi)     # backbone conformation character coding 
 show Sarray(stack,v_/2:10/phi,PSI) # character coding of a chain fragment

PSI

 
 ss=Sarray(5)       # create empty sarray of 5 elements  
 ss[2]="thoughts"   # assign string to the second element of the sarray  
 
 sa=Sarray("the first element") 
 
 show Sarray(Count(1 100)) # string array of numbers from 1 to 100

Sarray (sarray reverse )

 
Sarray({"one","two"} reverse)  # returns {"two","one"}

Iarray

reverse

reverse

String

Sarray ( sarray i_from i_to )

i_from

i_to

i_from

i_to

 
a={"123","12345"} 
Sarray(a,2,3) 
 {"23","23"} 
Sarray(a,5,2) 
 {"32","5432"}

Getting selected entries from index table

Sarray ( T_index )


read index "myindex"
S = myindex[2:8]
S[1]

write index

Score


 Score( <R_X> <R_Y> ) => r [-1.:1.] # overlap between two distributions
 Score( <R_Ei> <R_Di> <wE> <wD> ) => <r> # prediction quality
 Score( <I_keys1> <I_keys2> <nBits>|<R_bitWeights> [simple] ) => M # Tanimoto distances
 Score( <seq1> <seq2> [new|nucleotide|simple] ) => <R_scores>
 Score( <seqArray[n]> ) => Mnn_alignedScore
 Score( <i_len> <r_probability> [comp_matrix|similarity|identity] ) => <r_>
 Score( <ali2> [area|sort|comp_matrix|similarity|identity  [<i_alnLength>] ] ) => r # see also Distance (<ali>), and Rarray(<ali>) 
 Score( <X_n> <X_m> [[<R_Wn> <R_Wm>] <r_minScore> (0.4) [<r_steepness>(6.)]] set ) => r_inter_set_score [0:1]
 Score( <X_3D_n> [<X_3D_m>] field|similarity|distance ) => <M_nxm APF_scores> # needs Chemical(<as> exact hydrogen)
 Score( <X_3D_n> torsion ) => <R_Free_Strain> # 
 Score( <rs_n> [simple|info|comp_matrix] ) => <R_n_conserv_scores> # info is entropy
 Score( <rs_n> <seq_n> ) => <r_score_without_alignment>
 Score( <rs_n> <seq_n> all ) => <T_sel_scores_seqids>
 Score( <model> full|test [<s_stats>] ) => <r_learnStatistics>
 Score( predict <R|I_obs> <R|I_pred> [<R_weights>] ) => <R_allRegression_or_Classification_Stats>

A measure of overlap between two arrays

Score ( R_1, R_2 ) → r_overlapMeasure

- returns the real measure of overlap between two real arrays. This measure varies between -1 and 1.. (all values of R_1 are smaller than all values of R_2) and +1. (all values of R_1 are greater than all values of R_2) and may serve as a ranking criterion.
Examples:

 
 show Score({1. 2. 5. 3.} {3. 1.5 1.5 5.})  # 0. perfectly overlapping arrays 
 show Score({2. 5. 3.} {1. 1.5 0.5})        # 1. no overlap R_1 > R_2 
 show Score({1. 1.5 0.5} {2. 5. 3.})        # -1. no overlap R_2 > R_1 
 show 1.-Abs(Score({1. 3. 2.5} {2. 5. 3.})) # relative overlap between R1 and R2

Similarity score between two sets of ligands

Score( X_n X_m [[R_Wn R_Wm] r_minScore (0.3) [r_steepness(6.)]] set ) → r_sim_score [0:1]

This function returns a similarity (0. to 1.) between two sets (arrays) of chemicals. It is calculated as N_AB /( N_AA + N_BB - N_AB ), where N is an effective number is similar compounds calculated as weighted sum of sigmoidly transformed similarities to the power of one half. The original similarity measure is transformed by a sigmoid starting from r_minScore (0.) and ending at 1. The mid point of the sigmoid is at 0.5*(1+ r_minScore ) . A general form of the sigmoid before it is shifted and squeezed is 1./(1.+exp(-b*t)) where b is r_steepness . Arguments:

X_n , X_m : two chemical arrays of size n and m
W_n , W_m : weights assigned to the chemicals
r_minScore> (0.3) , first non-zero value of the shifted sigmoid.
r_steepness (6.) ; steepness of 0. corresponds to the linear function between 0. and 1.

r_out : N₁₁
r_2out : N₂₂
r_3out : N₁₂


 Score( t.mol tt.mol set )

Free Strain for 3D molecules

Score( X_3Dn torsion ) → R_n_free_strain

returns free strain(s) for a set of 3D molecules

Calculated as:

= -Sum(Log(Probability_Tors[i])

where

Probability_Tors[i] = Sum( prof[k]*Exp(-d*d) ) k : 1:36 # 36 bins d : current_torsion_bin-k # distance in bins between current torsion angle bin and k

The result includes entropy component : if there are three equal depth minima, even at the bottom of any of them there will be an offset penalty corresponding to entropy loss associated with locking out two of the three minima.

APF similarity between superimposed ligands in 3D

Score( X_3Dn [X_3Dm] field|similarity|distance ) → M_nxm apf_scores

returns all pairwise APF scores between 3D chemical arrays. Two prerequisities:

a superimposed set of ligands in a chemical 3D array
and APF parameters loaded with the
```
read pmf s_icmhome + "APF"
```

Chemical

exact

field : returns raw (negative for similar atom pairs) pairwise APF similarity scores Sij
similarity : returns normalized APF similarity calculated as S_ij_sim = |Sij|/Sqrt(|Sii*Sij|). (we change the signs on Sij values to make it positive if necessary). The similarities are 1. for self-comparison and numbers less for different molecules.
distance : returns chemical distance calculated as 1. - normalized_similarity .

Example:


build string "H"
build string "W"
build string "A"
add column t Chemical(a_*. exact hydrogen ) name="mol"
read pmf s_icmhome+"APF"
show Score(a_1. a_2. field)
sf = Score( t.mol field )
ss = Score( t.mol similarity )
sd = Score( t.mol distance )
CS = Rarray( ss 6 )  # off diagonal elements 
Mean(CS)             # average similarity

If the two sites (or atoms sets) are not superimposed, use the siteSuperAPFas1 as2 exact macro which makes the superposition and returns the un-normalized score. The normalized score can be returned after converting the superimposed selections into a 3D chemical array with the Chemical( as exact ) function Alternatively, the normalization can be done directly by the above formula ( S_ij = |Sij|/Sqrt(|Sii*Sij|) if self-scores are calculated.

Quality measures of a regression or classification model.

Score ( model [ test | full ] s_stats ) → r_PredictionQuality

Categorical or class prediction (e.g. Bayesian classifier). If each data record has a label which can be either positive or negative (say, 1, or -1) then the success of a prediction method can be measured by the following measures:

measure abbreviation/formula description
"tpos" TP number of true positives
"fneg" FN number of false negatives
"fpos" FP number of false positivers
"tneg" TN number of true negatives
"accuracy" Q=(TP+TN)/(TP+TN+FP+FN) fraction of correctly predicted label assignments
"sensitivity" TP/(TP+FN) fraction of correction predicted positive labels
"specificity" TN/(TN+FP) fraction of correction predicted negative labels
"mathews" (TP*TN-FN*FP)/Sqrt((TP+FN)(TP+FP)(TN+FN)(TN+FP)) Mathews correlation
"precision" PR=TP/(TP+FP)
"recall" RE=TP/(TN+FP) same as sensitivity
"f1" 2*PR*RE/(PR+RE)
Note that only "acuracy", and "mathews" are overall measures symmetrical with respect to the label. Frequently a method is characterized by an area under a recall - precision curve.

Quality measures to evaluate a regression method predicting numerical values, e.g. Partial Least Squares, or Kernel Regression.

measure formula description
"r2" r=Mean((X-))/(Stdev(X)*Stdev(Y)) ; r2=r*r correlation squared
"rmse" Sqrt(Sum((Xpred-Xobs)^2 )/N) root-mean-squared error
"expavg"

Scoring prediction quality from an array of errors and predicted scores

Score ( R_En, R_Dn, wE, wD ) → r_PredictionQuality
Evaluates the quality of submitted multiple predictions for a unknown outcome. The submitted provies R_En , the evaluator evaluates R_Dn from the correct answer, then plugs in the weights and calculates the quality. The Q (quality)-value of predicted "energies" R_En for n - states, by comparing predicted energies with the deviations R_Dn from the correct answer. In essence we are doing the following:

starting from distances (e.g. RMSD) from the correct answer Di (Di >=0.)
calculating their well-behaved and inverted version, exp( -w*D ) [0., 1.]
calculating the normalized Boltzmann average if the previous similarity measure
taking -Log of the previous average

Q = - Log( Sum( exp(- wE#(Ei-Emin) -wD*Di )) / Sum(exp(- wE#(Ei-Emin))) )

Aligning and scoring pairwise sequence alignment

Score ( sequence1, sequence2 )
- returns the real score of the Needleman and Wunsch alignment.
Each pair of aligned residues contributes according to the current residue comparison table, which is normalized so that the average diagonal element is 1. Insertions and deletions reduce the score according to the gapOpen and gapExtension parameters. Approximately, the score is equal to the number of residue identities.
To calculate an array of mean scores for each column of a multiple sequence alignments use the Rarray( ali [ exact ] ) function. i_out returns the number of identical residues.
Examples:

 
 read sequence msf s_icmhome+"azurins.msf"
 a = Score( Azur_Alcde Azur_Alcfa ) # it is around 90.

Score ( seq_n_long, seq_m_short simple ) → R_n-m+1_scores

returns an array of scores of sliding no-gap sequences.

Score ( seq_n, rs_N ) → r_no_gap_score

Score ( seq_n, rs_N all ) → T_N-n_scores_ids_for_all_frames

these two functions return the match or no-gap-alignment score for one frame or multiple frames with the all option.

The second function template returns a table with the following columns: i (relative number), nu (first residue number), sl (fragment selection string), se (the first residue code), sc (normalized alignment score divided by the sequence selfscore and multiplied by 100., id (sequence identity), sf (relative surface area), ss ( relative non-loopsecondary structure ). id , sf , ss range from 0. to 100. % . Make sure to assign the secondary structure and calculate the atomic surface areas before you fun the Score(.. all ) function. Example:


build string "ASDFY"
a=Sequence("SDF")
assign sstructure a_/A
show surface area 
t = Score( a_/A a all)
show t

See also: Distance( ).

Conservation score per residue from an alignment projected to a structure.

Score ( rs, [ simple | comp_matrix | info ] )

Setup: a multiple sequence alignment, one of the sequences is linked to a structure, you may want to color residues by conservation or other measure of a column in an alignment. For a straight conservation value for each position in an alignment see Rarray( ali ) The function returns the rarray of alignment-derived conservation values for the selected residues. For each residue Ri in the residue selection rs_ the following steps are taken:

a column is extracted from a linked N-sequence alignment ( see the link command )
Si = Sum( Cij )/N where j=1,..N and Cij is the residue comparison value
simple mode: Cij = 1. for two identical residues and 0. otherwise . It is also the default.
comp_matrix (also the default) mode: Cij is taken from a normalized comp_matrix . Its elements are calculated as Cij_norm = Cij/Sqrt(Cii*Cjj) with the negative elements set to zero. The sum, d, of normalized positive similarity values c_ij is then converted to the conservation value of (1/2·n_seq) √(8d+1) .
info (inverse information entropy mode): Cij = 1./(1.+Entropy), where Entropy= -Sum((f_{j)Log(f_j))}. The elements after normalization range from 0. (no conservation, 1. conservation).

 
  read alignment s_icmhome+"sh3.ali" 
  read pdb "1fyn" 
  make sequence a_a 
  group sequence sh3 
  align sh3 
  display ribbon 
  color ribbon a_a/A Score( a_a/A simple ) 
  show surface area 
  show Mean( Score( Acc(a_a/*) ) )    # conservation score for the surface 
  show Mean( Score( a_a & !Acc(a_/*)))# conservation score for the buried

See also: Rarray( ali [simple] )

Scoring an existing alignment, or calculating sequence identity.

Score ( ali2, [ { identity | similarity | comp_matrix | sort } [i_alnLength] )
- returns the real score of the given pairwise alignment calculated by different methods. By default the score (or number of identical pairs) are divided by the minimal length of the two sequences. With the third i_alnLength argument, it will be divided the this argument. If the 3rd argument is zero, the score/identity will be divided by the length of the alignment. For a straitforward alignment conservation see: Rarray( ali [simple] ) Options:

no second argument : the straight Needleman and Wunsch score: aligned residues score according to the residue comparison table, gaps according to the gapOpen and gapExtension parameters.
identity the number of identical residues in the alignment divided by the smallest sequence length and multiplied by 100 %.
comp_matrix the alignment score without the gap component. It contains only the total score of the aligned residues calculated from the residue comparison table and does not include penalty term.
similarity the alignment score without the gap component multiplied by 100. and divided by the smallest sequence length.
sort a score occasionally used for ranking/sorting the alignments in fold recognition. Currently it is equal to the comp_matrix_score - 1.3*totalGapPenalty

To extract a pairwise alignment of sequences 1 and 2 from a multiple alignment use the Align( ali I_seqIndexes ) function, e.g.


make sequence 5 20  # 5 random sequences of length 20
align sequence   # creates aln
Score( Align(aln, 1//2 ), identity)  # 1//2 results in an iarray {1,2}

To return a matrix of all pairwise seq. identities, use this:


n = Nof(sequence) 
mdist = (Matrix(n,n,1.) - Distance(Parray(sequence ) )) * 100. # 100 for identities

Score ( i_minLen, r_Probability [, { identity | similarity | sort } ] )

real

r_Probability

above

 
 Score( 150, 1./55000.,identity)

Probability

Select

Select() → as_graph_or_displayed_or_current

Select(as_source cond ) → as # conditions: "c" from x,y,z,b,o,c,f,a,u,v,w,n e.g. Select(a_ "b>80")

Select(os|ms|rs|as s_expr|fieldName|cond ) → o|m|r|as # 'n' number_of, 'r' resolution

Select(os|ms "biomt" i_Biomol ) → ms_of_biomol_i # molecules selected in i_Biomol

Select(as_source I_indices ) → as

Select(as_source hydrogen|hbond|smooth ) → as_expandByTerminalAtoms

Select(as_source fix|unfix) → as_atomsOn(un)FixedBranches

Select(as_source bond nNeighbors ) → as_atomsWithN_neighbors

Select(as_source molecule i_mol ) → as_atoms_of_ith_disconnected_fragment

Select(as grob ) → as_subset_in_grob_vicinity

Select(grob ) → as_currentObjAtomsNearGrob

Select(rs_patches i_MaxGapSize ) → rs_patchSmallGaps # e.g. Select(a_/1:2,4:6 2) → a_/1:6

Select(rs_patches margin,i_size ) → rs_expand_by_margin

Select(as error ) → as_flankingBackboneBreak

Select(as_inObjA os_objB ) → as_inObjB

Select(seq [ ms_where [r_min_seqid(0.2) [r_mx_length_dist(0.3)]]] ) → ms_sim_seq

Select atoms forming an abnormally long covalent bond

Select( as delete | error ) → as_bad_atom_pairs

this function returns pairs of atoms (i) connected with abnormal bond lengths, and, (ii) breaks in the backbone of a polypeptide (even if a 'C' carbon and the following 'N' are not bound covalently).

Example in which we find residues flanking the missing loop:


read pdb "2pe0" 
display ribbon
display residue label Res(Select(a_ delete ))

Select atoms on rotatable (or non-rotatable) branches.

Select ( as fix | unfix ) select atoms of the fixed or rotatable branches, for the fix or unfix options, respectively.

Select atoms with a certain number of covalently bonded neighbors.

Select( as bond i_NofBondedAtoms )

- returns a sub-selection of as with atoms having the specified number of covalent neighbors.

Example:


build smiles "CCO"
show Select(a_ bond 1) # selects all terminal hydrogens
show Select(a_ bond 2) # selects oxygen that is bonded to C and H
show Select(a_ bond 3) # no atom has three neighbors: 
show Select(a_ bond 4) # carbons have 4 neighbors

Select by number of sub-elements.

These functions allow one to select objects according to the number of molecules in them, and molecules according to the number of residues in them.

Select ( os "n==nofMolecules" )
selects objects by number of molecules in them
Select ( ms "n==nofResidues" )
selects molecules by number of residues in them

Allowed comparison operations : ==, >, >=, <, <=, != .

Example:


Select( a_A,N "n==1" ) # all single residue amino or nucl molecules 
Select( a_A,N "n>1" ) # all longer residue amino or nucl molecules

Select displayed or explicitly selected atoms

Select ( [ residue | molecule | object ] ) - returns either selected ( as_graph ) or displayed atoms ( a_*.//DD ). By providing the argument, you can change the selection level. Example:

 
  display skin Select(residue)

Expand selection to the bonded terminal atoms: hydrogens, polar hydrogens or all.

Select ( as_source [ hbond | hydrogen | smooth ] ) - returns the source selection expanded to single covalently bonded atoms, e.g. hydrogens. The returned selection is at the atomic level. Options:

hydrogen : adds all attached hydrogens
hbond : adds all polar (non carbon connected, sorry, no aromatic hbonds) hydrogens
smooth : adds all bonded terminal atoms (hydrogens or heavy atoms).

 
  Select(a_/tyr/o* hbond ) # adds hh to this selection
  Select(a_/tyr/cb hydrogen ) # adds hb1 and hb2
  Select(a_//ca,c,n smooth ) # carbonyl oxygen and N-terminal hydrogens

Select by coordinates, bfactor, occupancy or by user field

Select ( as s_condition [ r_Value] ) - returns a sub-selection of atom selection as_ according to the specified condition s_condition.

Select ( os|ms|rs s_condition ) - can use field names (see set field sel name=.. ) or presets 'n' for number of molecules in an object or number of residues in a molecule. Also 'r' for resolution (eg 'r<2.3' )
Three example conditions:
"X >= 2.0" , "Bfactor != 25." , "charge == 0." . Allowed properties and their aliases (case does not matter, the first character is sufficient) are as follows:

x,y,z atomic coordinates ("x","y","z")
bfactor ("bfactor","b","B")
occupancy ("o")
charge ("charge","c","q")
formal charge ("","f")
accessible surface area ("area","a")
user-field ("u") which can be set with set field and extracted with the Field function.
residue user-fields: "u","v","w" for the 1st, 2nd and 3rd field, respectively.
number of sub-components, number of molecules in one object, or number of residues in one molecule, see above.

show area

== != > < >= <=

r_Value.

Area

Bfactor

Xyz

Charge

Field

 
 build string "se glu arg" 
 show Select(a_//* "charge < 0.")|Select(a_//c* "x> -2.4") 
 show Select(a_//c* , "x>", -2.4) 
 
 show Select(a_/* , "w>3.") # 3rd res. user field greater than 3.

show a_//* & {-1.,10.,2.,25.,30.,22.}

a_//* & Box( ).

display box

Box

Select an equivalent selection in a different object

Select ( as_sourceSelection os_targetObject )
- returns the source selection as_sourceSelection from a source object which is transferred to another object ( os_targetObject ). The two objects must be identical in content. Example:

 
 build string "ASD" 
 aa = a_/2/c*          # selection in the current obj a_ 
 copy a_ "b"           # a copy of the source object 
 bb = Select(aa,a_b.)  # selection aa moved to a_b.

Select residues by a string array of selection expressions

Select ( os_sourceObject S_residueSelStrings )

- returns residue selection of the residue selection strings which can be returned with the Sarray ( rs_ residue ) function. The object name can be skipped. E.g.


 Select( a_2ins. {"a/14","b/14"} )

Select residues by a string array of selection expressions

Select ( os_sourceObject I_atomNumbers )
- returns atom selection of relative atom numbers in specified object os_sourceObject. The iarray can be generated with the Iarray ( as_ ) function. This function allows one to pass selections between ICM sessions.

Add small unselected residue ranges to a fragmented residue selection

Select ( rs_fragmented_selection [smooth|margin] i_gapSizeToHeal )

This function by default (or with option smooth) will take a source residue selection, identify all gaps of size below the specified parameter and will healing those gaps by adding them to the selection.

For example, if you have a residue selection, e.g. a_/1,2,5,6


Select(a_/1,2,5,6   2 ) # residues 3 and 4 will be added 
 a_/1,2,3,4,5,6

Option margin will simply expand the source selection by the specified margin size.

Select atoms interacting with a given selection

Select ( as_source "vw,14,hb,el,cn,tz" )

Interacting atoms.selecting atoms interacting with the source atoms according to a particular energy term. It is required that the source atoms are in the current ICM object and show energy command has been used at list once. See example below.

Tether destination atoms.In case of tethers ("tz") this function returns a selection of the static destination atoms (same as a_//Z ). Example:


build string "se ala his trp"
copy a_ "tz" tether # make a copy object and tether atoms to a_tz.
show energy  "vw,hb"
aca = a_//ca # selection of Ca atoms
Select( aca "vw,hb" )
Obj(Select( aca "tz" ))
  2  a_tz.  Type: ICM       Mol: 1     Res: 1     def

Select by alignment distance and sequence length mismatch

Select(seq [ ms_where [r_min_seqid(0.2) [r_mx_length_dist(0.5)]]] ) → ms_sim_seq

- returns molecular selection of all chains with sequences similar to seq . Arguments and options:

seq : a shell sequence or a dynamically generated sequence, e.g. Sequence(a_1.a)
ms_where : a subset of molecules for the sequence search. The default value is all molecules of all objects
r_min_seqid : is sequence identity threshold (0. to 1.) 1. for identical sequences, and 0. unrelated sequences.
r_mx_length_dist :the length distance is defined as the length_difference / longer_sequence_length and can only be a number between 0. and 1. (0. identical lengths, 1. no length restrictions). To ignore length filter, specify 1. or a larger number.


read pdb "1crn"
read pdb "2ins"
Select( Sequence( a_2.2 ) )
display ribbon a_
color ribbon magenta Select( Sequence( a_2.2 ) a_*. 0.2 0.3 )

Select by alignment

Select( rs_|as_ alig ) → selection_propagated_by_ali

Selecting residues by centers of mass

To select the closest residue from a center of mass of one selected residue, use the Sphere function with a coordinate matrix argument. We need to follow these steps:

Function Xyz(as_res) will return a set of coordinates (as a Nx3 matrix) for the selected residue atoms. Use as_res & a_*.//!h* for heavy atoms only.
Function Mean(Xyz(as_res)) to get a single vector with the center of mass
Use Sphere( Mean(Xyz(as_res)) as_targetSelection & ! as_res ) to select atoms at a certain distance from the center of mass.

Xyz(

residue

E.g.


  read pdb "1crn"
  display
  center_of_mass = Mean(Xyz(a_/44))
  display xstick magenta Res(Sphere( center_of_mass , a_1. & a_/!44 , 7.5) )  
# Res is added to select all residue atoms once an atom is inside the sphere

To find the closest residue to residue 44 in the above example, use the table approach, e.g.


read pdb "1crn"
display
center_of_mass = Mean(Xyz(a_/44))
nb = Res(Sphere( center_of_mass , a_1crn. & a_1crn./!44 , 7.5) )
if(Nof(nb)>0) then
 group table t Rarray(0) "dist" Sarray(0) "sel"
 for j=1,Nof(nb)
   add t
   cmj = Mean(Xyz(nb[j]))
   t.dist[j] = Distance( center_of_mass, cmj )
   t.sel[j] = String(nb[j])
 endfor
 sort t.dist  # the smallest distance is on top ([1]) now
 s_closest_res = t.sel[1]
endif

A faster implementation of the same task with the Group function with the "mean" argument. This solution can also be modified to use the closest atom (instead of the center of mass) by using "min".

Sequence

[ Dna to rna conversion | Reverse complement | Sequence array ]

Sequence ( as_select )

sequence

Sequence( s [ nucleotide | protein ] )

sequence

"protein".

set type

seq

nucleotide

protein

Examples:

 
 seqA = Sequence( a_1./15:89 )     # create sequence object  
                                   # with fragment 15:89  
 
 show Align(seq1, Sequence("HFGD--KLS AREWDDIPYQ")  
                                   # non-characters will be squeezed out 
 a=Sequence("ACTGGGA", nucleotide)  
 Type(a , 2)  # returns the type-string : 
 nucleotide

Sequence ( ali )

alignment

Sequence ( ali --group ) - returns a chimeric sequence which represents the strongest character in every alignment position.
Sequence ( profile ) - returns a chimeric sequence which represents the strongest character in every profile position.

convert DNA sequence to RNA, and back, replace T by U or U by T

Sequence ( seq_NAseq convert ) - converts DNA sequence to RNA sequence or the other way around, and returns a sequenced with T replaced by U or the other way around, depending on the source sequence.


dn = Sequence("AATTCCGG" nucleotide)  #create a DNA sequence
Type(dn, 2)
 dna
rn = Sequence( dn, convert ) # converting seq dn to RNA
Type(rn, 2) 
 rna
show dn, rn
 >dn
 AATTCCGG
 >rn
 AAUUCCGG
back_to_dna = Sequence( rn , convert )

reverse complement dna sequence function

Sequence ( seq_NAsequence reverse ) - returns the reverse complement DNA of RNA sequence:

 
     nucleotide      |complement 
_____________________|__________ 
                     |   
 A = Adenosine       | T (replace by U for RNA) 
 C = Cytidine        | G 
 G = Guanosine       | C 
 T = Thymidine       | A 
 U = Uridine         | A 
 R = puRine    (G A) | Y 
 Y = pYrimidine(T C) | R 
 K = Keto      (G T) | M 
 M = aMino     (A C) | K 
 S = Strong    (G C) | S 
 W = Weak      (A T) | W 
 B = !A      (G T C) | V 
 D = !C      (G A T) | H 
 H = !G      (A C T) | D 
 V = !T      (G C A) | B 
 N = aNy             | N

create sequence array

Sequence( S_sequenceString [S_seqNames] ) - converts sarray of sequence strings to protein-sequence parray

Sequence( S_namesOfLoadedSequences name ) - returns a parray of protein sequences retrieved by name from the ICM shell/workspace.

Example:


add column T Sequence( {"MILERR", "STAGKVIKCKAAVLW"} {"aa","bb"} ) name="seq"
add column T Length(T.seq) name="len"
set name T.seq {"seq1", "seq2"} # reset names
sq1 = T.seq[1]

read alignment s_icmhome + "sh3.ali"
add column t Sequence({"Spec","Fyn"} name)

Shuffle

Shuffle ( I | R | S ) → shuffled_array

Shuffle ( string ) → shuffled_string

Shuffle ( seq ) → shuffled_sequence

randomly change order of elements of an array or a sequence of characters.

Example:


a={1 2 3}
Shuffle(a) # {2 1 3}
Shuffle(a) # {3 1 2}
Shuffle("this") # won't tell you..

Sign

-1.|0.|+1.

Sign ( real )

real

Sign ( integer )

Sign ( iarray )

Sign ( rarray )

Sign ( map ) - returns map with -1., 0., 1. values.
Examples:

 
 Sign(-23)  
 -1 
 Sign(-23.3)  
 -1. 
 Sign({-23,13})  
 {-1,1} 
 Sign({-23.0,13.1})  
 {-1.,1.}

Sin

Sin ({ real | integer } )

real

Sin ( rarray )

rarray

 
 print Sin(90.)                  # equal to 1  
 print Sin(90)                   # the same  
 
 print Sin({-90., 0., 90.})      # returns {-1., 0., 1.}

Sinh

Sinh ( { real | integer } )

real

Sinh(x)=0.5( e^iz - e^-iz )

Sinh ( rarray )

rarray

 
 print Sinh(1.)                  # equal to 1.175201  
 print Sinh(1)                   # the same  
 print Sinh({-1., 0., 1.})       # returns {-1.175201, 0., 1.175201}

Site

site

Site ( s_siteID [ ms ])

 
 nice "1est"  # contains some sites  
 delete site a_1 Site("cat",a_1)

Slide

Slide( )

- returns a compact binary representation of the entire graphical view (also known in ICM as a slide). Slides include the following:

molecular representations, labels, sites and colors for multiple objects
grobs and their display attributes
the view point, zoom and scale
key graphics preferences
displacements of residue labels
the background color.

The data is packed into a single-element parray of a view type. These "slides" can be written and read as parts of the .icb project files with the read binary command.

To display the view use the display parray_slide command, e.g.


slide1 = slideshow.slides[1]
display slide1

Slide( gui )

- returns a slide containing only the current window layout information.

See also: String slide gui, add slide.

Smiles

convert chemical structure into a Smiles string.
Smiles ( as )

Smiles ( chem [unique] [cistrans|cartesian] )

- returns a smiles - string with the text representation of the chemical structure of a selected fragment or a chemical array.

The unique option will make that string independent of the order atoms in the molecule. The cartesian option will adds 2D or 3D coordinates at the end of the result smiles string. That coordinates will be used in Chemical function

Example:


read object s_icmhome + "biotin.ob"
s_smiles = Smiles( Chemical( a_ exact hydrogen  ) cartesian )  # coordinates will be preserved
read mol input=String( Chemical( s_smiles ))  # 
Srmsd( a_1. a_2. chemical )
#
# or even simpler
#
s_sm3d = "CCCCC|3D:1.39,-0.00,-0.02,2.16,1.30,-0.01,3.45,1.20,-0.82,4.23,2.52,-0.81,5.51,2.42,-1.62"
read mol input=String(Chemical( s_sm3d ) name="cc"

build smiles

String

as_

Smooth

[ Smooth | Smooth matrix | Smoothrs | Smooth alignment | Smooth map ]

Smooth

Smooth ( R_source, [ i_windowSize ] )

R_source

i_windowSize

windowSize

i_windowSize

i_windowSize=5

Smooth ( R_source, R_weightArray )

R_source

R_weightArray

i_windowSize

R_weightArray

R_source

 
 {-1.,1.}/Xstep             # for the first derivative 
 {1.,-2.,1.}/(Xstep*Xstep)  # for the second derivative 
 {-1.,3.,-3.,1.}/(Xstep*Xstep*Xstep)  # for the third derivative 
 #  ... etc.

 
 gauss=Exp( -Power(Rarray(31,-1.,1.) , 2) )   # N(0.,1.) distribution on a grid 
 x = Rarray(361,-180.,180.)                   # x-array grows from 0. to 180.  
 a = Sin(x) + Random(-0.1,0.1,361)            # noisy sine  
 
 b = Smooth(a,gauss)          # gauss averaging   
                              # see how noise and smooth signals look  
 plot x//x a//b display {-180.,180.,30.,10.} 
                              # take the first derivative of Sin(x)  
 c = Smooth(Sin(x),{-1., 1.}) * 180.0 / Pi  
                              # plot the derivative  
 plot x c display {"X","d(Sin(X))/dX","Derivative"}

Smooth two-dimensional averaging of matrix values

Smooth ( M_source, [ i_halfwindow (1)> [<r_radius (1.)>]] ) → <M

The values in the source matrix get transformed according to a Gaussian 2D transformation in which the values i,j get averaged with the values in the neighboring [i-n:i+n] [j-n:j+n] cells , (2n+1)^2 in total, according to the gaussian weights calculated as exp( r² / R² ), where R is the r_radius parameter, and n is the i_windowSize parameter. The default parameters are 1 for the i_halfwindow (corresponding to 9 cell averaging) and the radius of 1..

Examples:


  Smooth(Matrix(10),0) # keeps the matrix intact
  Smooth(Matrix(10),3,1.5) # weighted average with 7*7 surrounding values (7=3*2+1) for each cell.

Smooth: three-dimensional averaging of residue properties

Smooth ( rs, R_property, r_smoothRadius )

R_property

rs_

GRID.gcghExteriorPenalty

r_smoothRadius.

r_smoothRadius²).

set label

 
 nice "1tet"   # it is a macro displaying ribbon++ 
 R = Bfactor(a_/A ) # an array we will be 3D-averaging 
 color ribbon a_/A Smooth(a_/A R 1.)//5.//30.  # averaging with 1A radius 
 color ribbon a_/A Smooth(a_/A R 5.)//5.//30.  # with 5A radius 
 color ribbon a_/A Smooth(a_/A R 10.)//5.//30. # with 10A radius 
  # 5.//30. are appended for color scaling from 5. (blue) to 30.(red) 
  # rather than automated rescaling to the current range 
 
 set field a_/A Smooth(a_/A R 5.) 
 show Select( a_/A "u>30." ) # select residues with 1st field > 30.

Smooth: expanding alignment gaps

Smooth ( ali, [ i_gapExpansionSize ] )

alignment

i_gapExpansionSize

Smooth: transforming three-dimensional map functions.

Smooth ( map , [ "expand" ] )

weighted 3D-window averaging

Smooth( map )

map

Smooth

low-values propagation

Smooth( map "expand" )

map

 
 m_gc = Smooth(Smooth(m_gc "expand"), "expand" )

map

SolveQuadratic : roots of quadratic equation

SolveQuadratic( r_a r_b r_c | R_3|2 [all] ) → R_roots

returns an array of real roots of a quadratic equation ax² + bx + c = 0 By default only real roots are found. Option all : returns two complex numbers: {r1,i1,r2,i2} Example:


rts = SolveQuadratic(1. 2. 1.)  # one real root
show rts
 #>R 
 1.
Nof(rts)  # number of roots
 1

SolveQuadratic(1. 2. 3. all)  # two complex roots
 #>R 
  1.
  1.414214
  1.
 -1.414214

Sphere

selectSphereRadius

Sphere (as_source| grob|R_xyz|M_xyz [as_whereToSelect] [radius(5.)] ) → selection

this function returns a selection of atoms in a certain vicinity of the following set of points:

a group of atoms ( as_source )
any vertex point of a grob (i.e. graphical object)
a point in space ( R_xyz)
a group of points in space ( M_xyz) (e.g. see the Xyz function)

The atoms will be searched in the specified selection as_whereToSelect if the second selection is explicitly specified. If only one atom selection is specified, the atoms will be selected from the same object.
The function can be much accelerated if you specify the desired level of the resulting selection explicitly (e.g. molecule ,or residue, or object in the second function template). For example, if you just want to know molecules around a selection, you can say Sphere( a_1 a_2 6. molecule )

The selection level functions ( Res , Mol , and Obj ) can also be used to convert the atom selection into residues, molecules or objects, respectively (e.g. Res(Sphere(a_/15,4.)) ), if speed is not an issue or the explicit level option is not available.

For example, selection

 
 show Sphere( a_subA/14:15/ca,c,n,o , 5.2) 
 Res(Sphere( a_1.2 a_2.)) # residues of a_2. around ligand a_1.2 
 Sphere( a_1.2 a_2. 7. residue) # same but much faster

Adjusting for the van der Waals radii, vdW gap. Use negative distance values to indicate a different mode of the Sphere function. Sphere can also correct for the van der Waals radii if you specify the negative radius. Values < -1. indicate vdW gap ( -1.15 means 15% larger than the sum of vdW radii). In this case it is interpreted as a ratio of the inter-atomic distance to the sum of van der Waals radii. For example, Sphere( a_//a1 a_//a2 , -1.2 ) specifies the van der Waals gap of 1.2 , i.e. interatomic_distance / (R(a1) + R(a2)) will be compared with 1.2

The negative sign just flags the program to use the distance to vwlimit ratio instead of the distance. The value of -1.15 roughly corresponds to 3.5 . Example:


read pdb "2ins"
Sphere(a_1.1 a_1.2//!h*   3.4 )  # the traditional method
Sphere(a_1.1 a_1.2//!h* , -1.1)  # the corrected method

Sphere (M_xyz as_whereToSelect radius ring ) → as_ring_atoms

this function returns a selection of atoms in rings which centers are in a certain vicinity of the set of points in M_xyz.

Example:


read pdb "2l8h"
display xstick a_
make grob skin a_2l8h.a2 a_2l8h.a2 name="g_skin"
color g_skin white
color g_skin black distance Xyz( Sphere( Xyz(g_skin) a_N,A//RA 3.5 ring ) ring ) GROB.atomSphereRadius = 2.5
display solid smooth g_skin
printf " Info> %.2f%% of ligand surface is exposed to aromatic contact\n" (1-Mean(Mean(Color(g_skin))))*100

See also:

SoapMessage

function to form SOAP request or to parse a result from the server.

A SOAP message is special XML text which contains :

SOAP method name and a namespace
method arguments

SoapMessage( s_methodName s_methodNamespace ) returns soapMessage object with specified method name.

SoapMessage( soapMessage [ s_argumentName argumentValue ] ... ) adds a number of name/values pairs to the exiting soap message and returns a new soap message as a result

SoapMessage( s_xmlSource ) parses xml source and returns soapMessage object.

The following example form a SOAP request to the google search service.


# create a message with soap method 'doGoogleSearch'
req = SoapMessage( "doGoogleSearch","urn:GoogleSearch" )
# add method arguments
req = SoapMessage( req, "key","btnHoYxQFHKZvePMa/onfB2tXKBJisej" )  # get key from google
req = SoapMessage( req, "q", "molsoft" )  # search 'molsoft'
# some other mandatory arguments of 'doGoogleSearch'
req = SoapMessage( req, "start" 0, "maxResults" 10 )
req = SoapMessage( req, "filter", no, "restrict", "", "safeSearch", no )
req = SoapMessage( req, "lr", "", "ie" "latin1", "oe", "latin1" )

HTTP.postContentType = "text/xml"
read string "http://api.google.com/search/beta2" + " " + String(req)

# parse the result and check it for errors
res = SoapMessage( s_out )

if Error(res) != "" print "Soap error: ", Error(res)

See SOAP services for more information.

Sort

function to return the sorted version of array.
Sort ( sarray|iarray|rarray|chemarray [reverse] )

- returns the sorted array. Option reverse toggles the sorting order.
Examples:

 
 count_unique=Nof(Sort(Unique({1, 11, 7, 2, 2, 7, 11, 1, 7})))  # counts unique elements

Unique

sort

Split

[ Split tree | Split regexp | Split multisep | Split chemical ]

Split( s s_sepChars ) → S_words # to split into characters *

# "" to split into characters

Split( s s_sep exact|regexp ) → S_words

Split( S s_sepCols s_sepColNameValue [exact|regexp] ) → T_words # "A:1 B:2 C:3.3" into columns A,B,C, inverse to Sum(t,{"t.A","t.B"} " " ":")

Split( chem_1 [chiral|tautomer|mol|group] )→chem_multi # see also: enumerate ..

Split( table_n.cluster [r_threshold|i_nGroups] ) → I_n_groupIndices # needs: make cluster t

Split ( s_multiFieldString, s_Separators ) - returns sarray of parts of the input string separated by s_Separators.

Multiple spaces are treated as one space, while all other multiple separators lead to empty fields between them. If s_Separator is an empty string (""), the line will be split into individual characters. To split a multi-line string into individual lines, use Split( s_, "\n" ).
Examples:

 
 lines=Split("a 1 \n 2","\n")  # returns 2-array of {"a 1" " 2 "}   
 flds =Split("a b c")          # returns 3-array of {"a" "b" "c"}   
 flds =Split("a b:::c",":")    # returns 4-array of {"a b","","","c"}  
 resi =Split("ACDFTYRWAS","")  # splits into individual characters  
                               # {"A","C","D","F",...}

Field

Split ( s_multiFieldString s_separator exact ) - returns sarray of fields separated exactly by s_separator

Split tree cluster by threshold or number of clusters.

Split( table.cluster, [r_threshold]|[i_numberOfClusters] )

Returns iarray of cluster numbers for each row.

Example:


make tree t matrix "upgma" 
cl = Split( t.cluster, Max( t.cluster )/2 )

Split by a regular expression

Split ( s_source, s_separator, regexp )

- returns an sarray with the source string separated by regular expression

Useful separators:

'\s+' splits by single or multiple occurrence of spaces and tabs (brackets, pluses, quotes etc. remain)
'\W+' splits into words ('\W+' skips brackets, quotes etc. and matches the entire inter-word space)
'[\n\r]' (new line) splits a multi-line string into individual lines

Examples:



Split("a b \t\tc", "\s+", regexp) # returns 3-array of {"a" "b" "c"}

Split("a_asd_b_awe_c","_a.._", regexp) # returns { "a","b","c" }

Split by two separators

Split ( S_source, s_separator1, s_separator2 [regexp|exact] ) → T_table

- takes a sarray as an input. Each entry of sarray has the following syntax: namesep2valuesep1namesep2value ...

where name and value can be any text which does not contain sep1 or sep2

returns a table with columns name1, name2, etc. filled with corresponding values.

Example:


Split( { "a=1;b=2", "a=3;c=5" "d=1;b=1;e=7" } ";" "=" )

String

String() → s

String() → s_empty

String( as all|dot|sln|smiles ) → s

`String-date{String} ( date s_spec ) → s # see `Date

String( i|l|r [n_decimals] |pref ) → s_

String( blob_ [ 'base64'|'hex' ] ) # blob to string. see `Blob `read-blob

String( l_condition s_yes s_no ) # like this C expression l ? s1 : s2

String( alignment|macro|model|sequence ) → s

String( s i_from i_len ) | (fr to s) → s_substr

String( s key| hash ) (or 32 hash) → s_CRC32|s_MD5(see md5sum unix tool)

String( s n_repeats ) → s_repeated

String( slide gui ) → s_layoutString

String( table "tex"|"html" [header] ) → s_printTable

String( icm_word | unknown ) → s_

String( X_chemarray | table mol ) → s_sdfFileText

String( array | format ) → s_columnFormat

Detailed descriptions:
String ( seq ) - converts sequence into a string
String ( i ) - converts integer into a string . see Tostring
String ( r [ i_nOfDecimals] ) - converts real into a string . It also allows one to round a real number to a given number of digits after decimal point.
String ( s, i_NofRepeats ) - repeat specified string i_NofRepeats times
String ( string, all ) - adds flanking quotes and extra escape symbols to write this string in a form interpretable in shell in $string expression.

String ( string, html ) - return URL-encoded version of the input string argument. See also Table-urland Collection to parse URL encoded strings.

String ( collection, html ) - return URL-encoded query string from the input collection argument. See also Table-urland Collection to parse URL encoded strings.
String ( X_chem, html ) - return HTML5 canvas and rendering JavaScript

String ( w_img, html ) - return inline html image representation.

String ( s_input, s_default ) - if the input s_input string is empty returns the s_default, otherwise returns the s_input string
String ( string, i_offset, i_length ) - returns substring of length i_length. If i_length is negative returns substring from the offset to the end.
String ( { iarray | rarray | matrix } plot [ s_translateString ] ) - converts numbers into a string or ascii characters (the "Ascii art", i.e. 12345 -> "..:*#").
The range between the minimal and maximal values is equally divided into equal subranges for each character in the string. This function is useful for ascii visualization of arrays and matrices. The default translation string is ".:*0#". Another popular choice is "0123456789".
Examples:

 
 file=s_tempDir//String(Energy("ener"))   # tricky file name  
 show Index(String(seq),"AGST")              # use Index to find seq. pattern  
 tenX = String("X",10)                       # generate "XXXXXXXXXX"  
 
 show String(Random(1.,10.,30), plot ) 
 read matrix  
 show String(def," ..:*#")  # redefine the projection symbols

Tostring

show map

Extracting a substring or reversing the order of characters in a string

String ( i_from, i_to, string ) - returns substring starting from i_from and ending at i_to. If i_from is less than i_to the string is inverted. Zero value is automatically replaced by the string length, -1 is the last but one element etc.
Examples:

 
 String(1,3,"12345")  # returns substring "123" 
 String(4,2,"12345")  # returns substring "432" 
 String(1,0,"12345")  # returns "12345" 
 String(0,1,"12345")  # returns INVERTED string "54321" 
 String(-1,1,"12345") # returns "4321"

Date, time and their parts in string format

String( date s_format )

the format specifications are described in the Date function. Examples:


String( Date() "%A" ) # day of the week
String( Date() "%B" ) # month

See also : Date

Export a chemical or chemical array as sdf text.

String( X ) → s_sdfFile

generates a string buffer in mol/sdf file format. This can be used to read one or multiple chemicals from a table into 3D objects in ICM shell. Example:


group table t Chemical("CC=O")
read mol input=String(t.mol[1]) # creates 3D objects
# to write as a file use write table mol t

String( T mol ) → s_sdfFile

generates string buffer in mol/sdf format for the table or table selection. All table fields are included into the result

Example:


add column t Chemical({"Cc1ccc(C)c(c1)c1c(C=C2C(N(CC(O)=O)C(=S)S2)=O)cn(c2ccccc2)n1", "COc1cccc(C=C2C(N(CCC(O)=O)C(=S)S2)=O)c1OCC=C"})
add column t Predict(t.mol,"MolLogP") name="MolLogP"
String( t[1] mol )

Converting an alignment into text

String ( ali ) - converts the alignment into a multiline string . You can further split it into individual lines like "--NSGDG" with the Split(String(ali_)) command. The offset in a specific sequence and its number can be found as follows.
Examples:

 
 read alignment s_icmhome+"sh3" 
 offs=Mod(Indexx(String(sh3),"--NSGDG"),Length(sh3)+1) 
         # extract alignment into a string, (+1 to account for '\n') 
 iSeq = 1 + Indexx(String(sh3),"--NSGDG")/(Length(sh3)+1) 
         # identify which sequence contains the pattern

String ( ali tree )

Newick tree

http://evolution.genetics.washington.edu/phylip/newicktree.html

 
 read alignment s_icmhome+"sh3" 
 show String(sh3 tree)

Projecting properties from alignment to a member sequence.

String ( s_ali ali_from { seq | i_seqNumber } )

s_ali

ali_from.

seq_

s_ali

seq_.

 
 read alignment s_icmhome+"sh3"  # 3 seq.  
 cc = Consensus(sh3) 
 show String(Spec)//String(cc,sh3,Spec)

Projecting properties from member sequence to alignment

String ( s_seq { seq | i_seqNumber } ali_to s_gapDefChar )

seq_

ali_to

R_seq

seq_.

r_gapDefChar

l_showResCodeInSelection

 
 read alignment s_icmhome+"sh3"  # 3 seq.  
 ssFyn = Sstructure(Fyn) 
 set sstructure Spec String(String(ssFyn,Fyn,sh3,"_"),sh3,Spec) 
 show Spec

Returning one of two alternatives depending on a condition

String( l_condition s_choice1 s_choice2 )

This function is equivalent to the question mark operator in C, e.g. condition?choice1:choice2 Example:


a=3
String( a>1 , "big a", "small a" )

String( selection ): converting selections into the text form

String( { os | ms | rs | as } [ name | number ] [ i_number ] )

String( { os_1 | ms_onOneObj } simple )

converts a selection into a compact string form. Continuous blocks of selected elements in different molecules or objects are separated by vertical bar ( | ) which means logical or ( e.g. a_a.1:4|a_b.2,14 ) You can also divide this selection info a string array with the Split function.
Option i_number allows one to print only i-th element of the selection. It is convenient in scripts. For atom selections it will also show full information about each atom, rather than only the ranges of atom numbers.
This string form is convenient used for several purposes:

to store selections in tables and arrays.
to transfer selections from object to object and from session to session (see also the Select function)

name

a_/^F23

a_/23

number

name

Options:

simple : a special function that returns the selection in terms of the name of a single object (make sure that only one object is selected) and
name | number : allows one to show or hide, respectively, the one-letter code of a residue (e.g. '^W123'). the names of the molecules (e.g. a_1abc.a ). See also the Name function to get object of molecule names (e.g. Name( a_1. ) ↑ "1abc" )

 
 nice "1crn" 
 l_showResCodeInSelection = no 
 nei = String( Res(Sphere( a_/leu a_/!leu , 4.)) )   
 show nei 
 a_1crn.a/14:17,19:20 
 display xstick $nei

 
 read pdb "2ins" 
 for i=1, Nof( a_//c* ) 
    print String( a_//c* i ) 
 endfor

l_showResCodeInSelection

Retrieve window layout string from a slide

String( slide gui ) → s_layoutString

retrieves the string with the window layout information which is stored in the slide.

Example:


sl = Slide(gui)
undisplay window="all"
# take a look
display window=String( sl gui )

Sstructure

Sstructure ( rs )

`rs_

Sstructure ( { rs | s_seqStructure } compress )

Replace

 
 show Sstructure("HHHHHHH_____EEEEE",compress)  # returns string "HE"  
#    
 read object "crn" 
 show Sstructure( a_/A , compress)  # returns string "EHHEB"

Sstructure ( { seq | s_sequenceString } )

alignment of sequence group

seq_

set sstructure

make sequence ms_

delete sstructure

Sstructure

seq_

Frishman and Argos

vide infra

fasta

Pearson and Lipman, 1988

ktup=1

Method used for derivation of single sequence propensities.

Zhang et al., 1992

Turn

sequence)

Hutchinson and Thornton (1994)

M_out

seqLength].

M_out[1] : alpha-helix propensity [0., 1.]
M_out[2] : beta-sheet propensity [0., 1.]
M_out[3] : coil propensity [0., 1.]
M_out[4] : prediction reliability [0., 1.]

Sstructure ( seqarray )

sequence parray

Examples:

 
 show Index(Sstructure(a_1crn.,"HHHHHH"))  # first occurrence of  
                                           # helix in crambin  
 
 read sequence "sh3"    # load 3 sequences (the full name is s_icmhome+"sh3")  
 show Sstructure(Spec)  # secondary structure prediction for one of them 
 show Sstructure("AAAAAAAAAAAAA")  # sec. structure prediction for polyAla 
 
 read sequence "fasta_results.seq" 
 group sequences a unique 0.05     # remove redundant sequences 
 show Sstructure(my_seq_name)      # the actual prediction, be patient 
 plot number M_out display         # plot 3 propensities and reliability

Sum

[ Sum chemical | Sum image ]

Sum ( iarray )

Sum( { rarray | map })

real

Sum ( matrix )

Sum ( sarray [ s_separator ] )

concatenated

s_separator

Split

 
 show Sum({4 1 3})                     # 8  
 show Sum(Mass(a_1//*))                # mass of the first molecule  
 show Sum({"bla" "blu" "bli"})         # "bla blu bli" string  
 show Sum({"bla" "blu" "bli"},"\t")    # separate words by TAB  
 show Sum({"bla" "blu" "bli"},"\n")    # create a multiple line string

Sum( T_table { S_cols } s_sep1 s_sep2 ) → S_result

- returns sarray where each element formed as follows:

colname1sep2value1sep1colname2sep2value2 ...

Empty values are skipped. This function can be used to 'shrink' sparse tables

Example:


add column t {"" "a" "" "" "b"} {"c" "" "j" "r" ""} {"a" "u" "" "" "b"}
Sum( t ":", "=" )

Table

Table(ali I_aliPos residue|label) → T_posNumbers

`alignment-as-table{Table} (ali [number]) → T_seqColumns

Table( map ["min"|"max"] ) → T_x_y_z_value

`Table-matrix{Table} (matrix_nxm [S_colnames_m]) → T_with_m_columns

`Table-pairs{Table} ( matrix_nxm [ S_rowtags_n S_coltags_m ] index ) → T_nm_ij_pairs

Table( model term|merit) → T_statReport

Table( T S_Tcolnames) → T_columnSubseletion

Table( pairdist distance ) → T_atomsPairs

Table( parray [s_colName]) → T_with_parray_column

Table( collection ) → T_converted_from_the_collection

Table( residue ) → T_icm_res_names_codes

Table(seq) → T_resContent (name,n,freq)

Table(seq site) → T_siteInfo (key,fr,to,list,desc)

Table( stack [vs] ) → T_confTerms

Table( s_svgTextNodesEdges plot ) → T_nodes_edges

`Table-url{Table} (URL_encoded_string_a=b&c=d&e=ff [crypt] ) → T_name_value

- returns a table of icm residues from the icm.res file loaded by the read library command with the following columns:


.char .name .type .desc


t = Table(residue)
tt = t.type=="Amino"
show tt  # currently loaded amino acids
if Index(tt.name,"tyr") != 0 print "legal residue name"

Table: decoding a URL string

Table ( s_URL_encoded_String [ crypt ] )
- returns the table of "name" and "value" pairs organized in two string arrays. The URL-encoding is a format in which the HTML browser sends the HTML-form input to the server either through standard input or an environmental variable. The URL-encoded string consists of a number of the "name=value&name=value..." pairs separated by ampersand ( & ). Additionally, all the spaces are replaced by plus signs and special characters are encoded as hexadecimals with the following format %NN. The Table function decodes the string and creates two string arrays united in a table.
Option crypt allows one to interpret doubly encoded strings (e.g. ' ' is translated to '+' which then converted into a hexadecimal form). Frequently the problem can be eliminated by specifying the correct port. Example: you need to set a="b c" and d="<%>". Normal server will convert it to a=b+c&d=%3C%25%3E. Double encoding leads to a=b%2bc&d=%253C%2525%253E. To parse the last string, use the crypt option.
To see all the hidden symbols (special attention to '\r'), set l_showSpecialChar =yes.
Examples:

 
 read string       # read from stdin in to the ICM s_out string 
 a=Table(s_out)    # create table a with arrays a.name and a.value 
 show a            # show the table 
 for i=1,Nof(a)    # just a loop accessing the array elements 
   print a.name[i] a.value[i] 
 endfor

Getenv( )

Converting alignment into a table

Table ( alignment [ number ] )

- returns the table of relative amino acid positions for each of the sequence in the alignment. Gaps are marked by zero. Note that here columns correspond to different sequences while rows correspond to alignment positions. In the next function this order is reversed. The first column of the table, .cons , contains sarray of consensus characters. All the other arrays are named according to the sequence names by default, or by the sequential number of a sequence in the alignment, if option number is specified. The table may be used to project numbers from one sequence to another. See also the Rarray( R_, ali_, seq_ ) function.

This table may look like this:

 
#>T pos 
#>-cons----seq1-----seq2------- 
   " "       0        1           
   " "       0        2           
   C         1        3           
   " "       2        0           
   ~         3        4           
   C         4        5           
   " "       0        6           
# for the following alignment: 
# Consensus    C ~C   
seq1         --CYQC-  
seq2         LQC-NCP

exact

 
 read alignment "sh3"    
 t = Table(sh3 number)  # arrays t.1 t.2 t.3  
 t = Table(sh3)         # arrays t.cons t.Fyn t.Spec t.Eps8  
# 
 cc = t.cons ~ "[A-Z]"  # all the conserved positions  
 show cc                # show aa numbers at all conserved positions 
 show t.Fyn>=10 & t.Fyn<=20 # numbers of other sequences in this range

Outputting a table of residue numbers for corresponding positions

Table( alignment, I_alignmentPositions , residue | label )

table

I_alignmentPositions

residue

label

Y25

If an alignment is linked to a 3D molecule, all cell of this row will show both sequence numbers, as well as residue numbers of the linked 3D molecule, see example below. The columns names are composed of letter 'p' for position and alignment position (eg p11, p12 .. )

Note that in contrast to the previous function, this function looks like an alignment and has the same orientation. Each row corresponds to a different sequence, the sequence name is stored in the first column, while other columns contain residue numbers in the selected alignment positions.

Example:


 Table( aaa {1 11 13 16} label ) # aaa contains three sequences

seq	p1	p11	p13	p16	comment
1fyn_a		L10,90	D12,92	A15,95	this sequence is linked to molecule a
Spec	D1	L11	D13	E16	Spec sequence positions starting from 1
Eps8	K1	K11	D13	A16	Eps8 sequence positions starting from 1

Converting matrix into table columns

Table ( matrix [ S_colnames ] ) → T

- returns table with matrix columns named 'A', 'B', .. or according to the second argument. Example:


 t= Table(Matrix(3),Sarray(3,"A")+Count(3))
 show t
 >T t
 #>-A1---A2---A3----
    1.   0.   0.         
    0.   1.   0.         
    0.   0.   1.

The inverse operation can be done with Matrix ( table , S_colNames ) function.

Converting a square symmetric matrix to a table with pairs of elements.

Table(matrix_nxm [S_rowtags_n S_coltags_m] index ) → T_nm_ij_pairs

This function will return a table with three or five columns, named I,J,C or A,B,I,J,C containing a two indexes and (if provided) two names of elements and their Mij value. It will return all values. It the

Example:

build string "ala" m=Distance(Xyz(a_//c*) Xyz(a_//c*) ) # carbon distance matrix ats = Sarray( a_//c* ) t = Table(m ats ats index) add column t t.I-t.J name={"D"} delete t.D<=0 # get rid of the diagonal and lower triangle show t # another example: m = Random(2,4,1.,3.) # matrix 2x4 r= {'a','b'} # row tags c= {'u','x','y','z'} # col tags Table( m r c index)

Extracting parameters of stack conformations

Table( stack [ vs ] )
- return table of parameters for each conformation in a stack . If a variable selection argument is provided, the values of the specified variables are returned as well.

 
% icm 
 build string "ala his trp" 
 montecarlo 
 show stack 
 iconf>       1       2       3       4       5       6       7 
 ener>    -15.1   -14.6   -14.6   -14.2   -13.9   -11.4    -1.7 
 rmsd>      0.3    39.2    48.0    44.1    27.4    56.6    39.3 
 naft>        1       0       0       1       1       1       0 
 nvis>        4       1       1       4       4       4       1 
t= Table(stack) 
show t 
 #>T t 
 #>-i--ener--------rmsd--------naft--------nvis------- 
    1  -15.126552  0.295555    1           4 
    2  -14.639667  39.197378   0           1 
    3  -14.572973  47.996203   0           1 
    4  -14.220515  44.058755   1           4 
    5  -13.879041  27.435388   1           4 
    6  -11.438268  56.636246   1           4 
    7  -1.654792   39.265912   0           1 
t1= Table(stack v_//phi,psi)  # show also five phi-psi angles 
 #>T 
 #>-ener----rmsd--naft-nvis------v1------v2------v3------v4-------v5----- 
    1 -15.12   0.29   1   4     -79.10  155.59  -75.30  146.99  -141.13 
    2 -14.63  39.19   0   1    -157.22  163.56  -78.25  139.51  -137.30 
    3 -14.57  47.99   0   1    -157.26  166.87  -85.08   92.55   -84.74 
    4 -14.22  44.05   1   4     -67.65   80.43  -76.67  103.05   -81.85 
    5 -13.87  27.43   1   4     -82.72  155.86  -85.02   93.11   -81.46 
    6 -11.43  56.63   1   4     -78.28  152.80 -154.79   66.26   -77.61 
    7  -1.65  39.26   0   1     -78.17  169.41 -133.89   96.39   -96.03

Iarray stack

Table: resorting nodes and edges generated by graphviz-dot neato tool with svg output.

Table ( s_graphviz_svg plot ) → T_nodes_edges_for_resorting

takes the svg output of the neato tool from the graphviz dot package and parses it into rows for resorting to solve the problem of lines overlapping the nodes. The table contains the following columns:

i : original order
tx : svg text corresponding to header/footer or a node/edge row
ty : type of the row: one of four: "begin","node","edge","tail"
width : a number for sorting. Attempts to find tags: stroke-width and put thick lines on top.

Example:


read string "/tmp/sgraph.svg" name="svg" # original svg with overlaping edges.
tsvg = Table(svg, plot)
sort tsvg.width
write Sum(tsvg.tx) "/tmp/sgraph_sorted.svg"

Column weights in PLS models and relative contributions of the descriptors.

Table( plsModelName [ term | merit ] )

returns a table with three columns: name mean rmsd, -w (weight) and -wRel columns. The header of the table contains the free term ( constant b ). The linear model can be represented as Ypred = b + w_{1*X_{1+w_{2*X_2+...The}}} wRel column returns the following value:( Abs(w_k) * Rmsd(X_k) ) / Sum_k( Abs(w_k) * Rmsd(X_k) )

Example:


A = Random(1. 10. 20)
group table T A A*2. "B" Random(1. 10. 20) "C" Random(1. 10. 20) "D" 
write binary Apred
delete Apred
#
read binary "Apred"
Table( Apred term )
 #>r .b
   0.012402
 #>r .self_R2
   0.999998
 #>r .test_R2
   0.999885
 #>r .self_rmse
   0.002908
 #>r .test_rmse
   0.030066
 #>T
 #>-name--------mean--------rmsd--------w-----------wRel-------
    B           11.04767    4.620291    0.499992    99.726648
    C           5.749607    2.681675    -0.001182   0.13679
    D           4.686537    2.686346    -0.001178   0.136562

Column weights in PLS models and relative contributions of the descriptors.

Table( s_buildInModel|F_model X_chemarray [inverse] )

Returns table with the following columns

name: descriptor name or SMARTS definition for the fragment from the fingerprint
w: weight coefficient
wRel: relative weight coefficient
ch_#: for each chemical from the X_chemarray a column contains number of matches of the chain or descriptor value for numerical descriptors.

inverse option returns fragments which are not present in the model. ~w and ~wRel columns are omitted in this case.

Example:


tt = Table( "MolLogP", Chemical( "CCO" ) )
tt.ch_1 != 0
Table( "MolLogP", Chemical( "OOO" ) inverse )
tt_stat = Table( myModel, tt.mol inverse )

Exporting a table from an interatomic distance object of hydrogen bond parray.

Table( hbondpairs|atompair_distances|angles|torsions distance ) → T_atomsPairs

- takes a distance object and returns a table with the following columns


 atom1 # selection , e.g. a_a.b/^T3/cn
 atom2 # second atom
 dist  # distance in Angstroms
 color # color if present
 label # label of this distance

atom3

atom4

Example in which we find the shortest hydrogen bond in crambin:


read pdb "1crn"
convertObject a_ yes yes no no
make hbond  name="hbonds_crn"
show Nof( hbonds_crn ) # counts distances
t = Table( hbonds_crn distance )
sort t.dist
show t[1]

See also: make distance , make hbond , Nof-distance{Nof(d,distance)}

Generate plot of pronotation states concentration vs pH

Table ( chemical charge )

- returns the table with concentrations of various prononation state vs pH

Table( chemical r_pH r_concCutoff charge )

- returns the table with protonation states at given ph4 and percentage cutoff

Tan

Tan ( { r_Angle | i_Angle } )

real

Tan ( rarray )

 
 show Tan(45.)               # 1.  
 show Tan(45)                # the same  
 
 show Tan({-30., 0. 60.})    # returns {-0.57735, 0., 1.732051}

Tanh

Tanh ({ r_Angle | i_Angle } )

real

Tanh ( rarray )

 
 show Tanh(1)                 # returns 0.761594  
 show Tanh({-2., 0., 2.})     # returns -0.964028, 0., 0.964028

Tensor

Tensor ( M)

M_ki (k=1,K,i=1,N)

NxN

< X_i >< X_j > - < X_i X_j >

k=1

 
 Transpose( xyz ) * xyz / Nof(xyz)

In one-dimensional case, N=1, when M_ is just one column (k=1,K; i=1,1) the function returns a one by one matrix with the mean-square-deviation of the vector (which is equal to Rmsd(R_)*Rmsd(R_)).
N=2, x and y dimensions; In this case the function returns the 2 by 2 matrix: with <x>²-<x²> and <y>²-<y²> on the diagonal and <x><y>-<xy> off-diagonal elements.
In three-dimensional case the function returns three by three tensor of inertia (it was too tiring to type the formula in html). This matrix is useful for superposition of bodies or molecules on the basis of shape, since three principal coordinates can be easily derived from the tensor using the Eigen or Disgeo functions. This trick used in the dockScan script ( _dockScan.

 
 build string "AAA"       # a long molecules  
 xyz = Xyz( a_//c* )  # a coordinate matrix of carbons 
     # you can also do it with grobs: xyz = Xyz( g_myGrob ) 
 a=Tensor(xyz)        # compute 3 by 3 matrix of the second moments 
 b=Eigen(a)              # returns 3 axis vectors 
 ax1= b[?,1] # this is the longest half axis 
 ax2= b[?,2] # this is the second half axis 
 ax3= b[?,3] # this is the shortest half axis 
 len1 = Length(ax1)  # long axis length 
 len2 = Length(ax2)  # mid  axis length 
 len3 = Length(ax3)  # short axis length 
 r = Matrix(3,3) 
# to make the rotation matrix from b normalize the axes  
   r[?,1] = ax1 / Length( ax1 ) 
   r[?,2] = ax2 / Length( ax2 ) 
   r[?,3] = Vector( r[?,1], r[?,2] ) 
   rotate a_ Transpose(r)  # rotates the principal axes to x,y,z 
# x the longest

calcEllipsoid M_xyz

ellipseRotMatrix

ellipseAxis1

ellipseAxis2

ellipseAxis3

Rot

rotate

transform

 
 build string "se ala ala ala ala"  # let is define the ellipsoid 
 display virtual 
 a = Tensor(Xyz(a_//!h*))  # Xyz returns matrix K by 3 
 b=Eigen(a)                # 3x3 matrix of 3 eigenvectors 
 b[?,1] = b[?,1] / Length(  b[?,1] ) # normalize V1 in place 
 b[?,2] = b[?,2] / Length(  b[?,2] ) # normalize V2 
 b[?,3] = Vector( b[?,1], b[?,2] )   # V3 is a vector product V1 x V2 
 rotate a_ Transpose( b )  # b is the rotation matrix now 
  # Transpose(b) is the inverse rotation 
 set view  # set default X Y Z view

Temperature

Temperature ( { s_DNA_sequence | seq_DNA_sequence } [ r_DNA_concentration_nM [ r_Salt concentration_mM ] ] )

real

Tm=DH/(DS + R ln(C/4)) -273.15 + 16.6 log[K+]

C

[K+]=

Time

Time ( string )

00:12:45

Time ( )

real

 
 if (Time( ) > 3660.) print "Tired after " Time(string) " of work?"

Tointeger

convert to integer values or arrays.

Tointeger ( string|real|integer|logical ) - converts to integer

Tointeger ( sarray|rarray|iarray|array ) - converts each element to integer, returns iarray.

Tointeger ( R_source R_splitPoints I_values ) - maps real numbers from the R_source to integers. The R_splitPoints array of a size n should contain numbers in increasing order. Those n points will be used as split points for n+1 intervals. I_values of size n+1 specifies numbers to be assigned to values in each of those intervals.

Example in which we form two classes for positive and negative values. Useful, e. g. in classification problems .


Tointeger( {-1., -2., 3. 4. 5. 6.},{0.},{-1,1} )
 {-1, -1, 1, 1, 1, 1}

A more general splitter:


Tointeger({1. 2. 3. 4. 5. 6.},{2.5,4.5},{2,4,6})

Tointeger ( S_source S_labels I_values )

Tointeger ( I_source I_labels I_values )

- these functions recode source, replacing each value found in labels array by the respective value from the values array. Thus, values array should have the same number of elements as the labels array. Alternatively, it may contain an extra element, and that last element will be interpreted as the default value for everything from the source not listed in labels.

Example:


Tointeger( {"dit" "dah" "dah" "dah" "dit" "dah"} {"dit" "dah"} {0 1} )
 0 1 1 1 0 1
Tointeger( {"dit" "dah" "dah" "XXX" "dit" "dah" "YYY" "dah"} {"dit" "dah"} {0 1 100} )
 0 1 1 100 0 1 100 1
Tointeger( {1 5 1 5 6 7 6 1} {1 5} {2 3 0} )
 2 3 2 3 0 0 0 2

Toreal

convert to real values or arrays.

Toreal ( string|real|integer ) - converts to real
Toreal ( sarray|rarray|iarray ) - converts each element to a real, returns a real array. see Rarray( ).

Toreal ( S S_n_keys R_n1_values ) # R_values has n or n+1 elements - converts each key to a respective real value. If values contains n+1 elements, the last value is the deault value (used to convert all keys not in keys).

Example:


Toreal({"c","a","c","c"},{"c","a"},{1,2}) # two classes
 {1, 2, 1, 1}
Toreal({"c","a","q","c","k"},{"c","a"},{1.8,2.3,0.5}) #with default value 0.5
 {1.8, 2.3, 0.5, 1.8, 0.5}

Support for special values in real arrays.Section rarray constant describes special values in real arrays that may appear in real columns of tables upon reading the Excel/csv files or property fields of the mol (or sdf) . Example: Create file 't.csv' that looks like this:


1.1
ND
3.3
INF
>3.
<2.


  read csv "t.csv" 
  t.A == Toreal({"ND"})
  t.A != Toreal({"ND","INF"})
  t.A == Toreal({">3."})

Tostring

convert to integer values or arrays.

Tostring ( string|real|integer ) - converts to string

Tostring ( sarray|rarray|iarray ) - converts each element to a string, returns sarray.

Tostring ( seqarray ) - returns sarray with sequences extracted from sequence parray elements.

See also: Toreal , Tointeger , Sequence

Toupper

convert to the uppercase.
Toupper ( string ) - returns the string converted to the uppercase. The original string is not changed
Toupper ( sarray ) - returns the sarray converted to the uppercase. The original sarray is not changed.

Toupper ( string|sarray 1 )
Examples:

 
 show Toupper("promotion") 

 show Toupper("joseph louis gay lussac",1) 
 Joseph Louis Gay-Lussac
 
 read sarray "text.tx" 
 text1 = Toupper(text)

Tolower

Tr123

Tr123 ( sequence )

 
 show Tr123(seq1)

Tr321

IcmSequence

Tr321

Tr321 ( s )

sequence

Tr123

 
 show Tr123("ala his hyp trp")  # returns AHXT

Trace

Trace ( matrix )

real

 
 show Trace(Matrix(3))     # Trace of the unity matrix [3,3] is 3.

Trans

[ Dna translate ]

Trans ( R_12transformationVector )

R_3

transformation vector

Six frame DNA/RNA sequence translation

Trans ( seq_DnaOrRnaSequence )

Sequence

seq_

reverse

 
 w=Sequence("CGGATGCGGTGTAAATGATGCTGTGGCTCTTAAAAAAGCAGATATTGGAG") 
 show Trans(w), Trans(w[2:999]),Trans(w[3:999]) 
 c=Sequence(w,reverse) 
 show Trans(c), Trans(c[2:999]),Trans(c[3:999])

Trans ( seq_DnaOrRnaSequence { all | frame } [ i_minLen] [ s_startCodons] )

table

i_minLen

s_startCodons

"ATG"

"ATG

frame

all

i_minLen

frame - integer 1 2 3 for the direct chain, or -1, -2, -3 for the complementary chain, respectively
left - translation offset in the direct strand (even if translation occurred in the complementary chain)
right - translation offset in the direct strand.
dir - direction (+1 for the direct, -1 for the complementary)
len - fragment length
seq - sequence string

 
#>-frame-------left--------right-------dir---------len---------seq-------- 
   -1          22          57          -1          12          XCVXVAAESVAS

 
  dna=Sequence("TTAAGGGTAA TATAAAATAT AAAGTTCGAA CAATACCTCA CTAGTATCAC AACGCATATA") 
  T=Trans(dna frame 10) 
  sort T.left 
  show T

Transform family of functions.

Transform( s_group|iGroup|os_1|map ) → R_12N_all_fract_transformations

Transform( s_group|iGroup|os_1|map iTrans ) → R_12_fract_transformation_i

Transform( s_group iTrans R_6cell ) → R_12_abs_transformation_i

Transform( obj "bio" i_biomol ) → R_12N_abs_BIOMT_transformations

Transform( s_symbolic_transformation ) → R_12 # not ready

Transform( R_6 ) → R_12

Transform( M_4x4 | M_3x3 ) → R_12_transformation

Transform( R_12 inverse ) → R_12_inverse_transformation

returns one or n transformations in the form of one 12*n long vector. Here os_1 means selection of one single object (e.g. a_ for the current object). The crystal symmetry and the biological symmetry can be imposed with the set symmetry command.

Transpose

matrix function.
Transpose ( matrix ) - converts the argument matrix[n,m] into the transposed matrix [m,n]
Transpose ( rarray ) - converts real vector [n] into a one-column matrix [n,1]
Examples:

 
 Transpose(a)              # least squares fit  
 Transpose({1. 2. 3.})     # [3,1] matrix

Transpose( table [i_nameColumn] ) - converts the argument table[nrows,ncols] into the transposed table [ncols,nrows]

All columns in the result table will be assigned the same type which is determined from column types of the source table. The result type can be either iarray, rarray or sarray.

Optional argument i_nameColumn specifies the column number in the source table which will be excluded from the transposition and it's values will be used to assign column names in the result table.

Example:

read sequence "seqs" group sequence "a" align a t = Table( a Count(Length(a)) label ) t2 = Transpose( t 1 ) # transpose and use first column values as result column names (t2.Azur_Alcfa == "-").Azur_Alcde # get residue labels in Azur_Alcde which corresponds gaps in Azur_Alcfa

Trim

Trim ( R [ r_percentile [ i_mode ]] )

r_percentile

i_mode

d_new = b + log(1.+(d-b)/(b-a))

Return values:

The function returns an array with corrected outliers.
The adjusted boundaries are returned in the r_out and r_2out values.
The number of outliers is returned in i_out .

r_percentile

i_mode

 
 Trim({0. 1. 4. 6.})     # keeps values unchanged 
 Trim({0. 1. 4. 6. 55.},0.9,1) # returns {0. 1. 4. 6. 11.3} 
 Trim({-33. 0. 1. 4. 6. 55.},0.9,1) # returns {-3.5 0. 1. 4. 6. 11.3}

Trim ( R rainbow|fix ) - a linear transformation to the [0., 1.] range, useful for generating a rainbow color index. The fix option transforms to a fixed range of [-1.,1.], useful for machine learning.
Trim ( I_iarray i_lower i_upper ) - returns iarray clamped into the specified range. Values smaller than i_lower are replaced with i_lower, and values greater than i_upper are replaced with i_upper.
Trim ( R_rarray r_lower r_upper ) - returns rarray clamped into the specified range.
Trim ( i i_lower i_upper ) - returns integer clamped into the specified range (e.g. Trim(6,1,3) returns 3).
Trim ( r r_lower r_upper ) - returns real clamped into the specified range.
Trim ( M_matrix r_lower r_upper ) - returns matrix clamped into the specified range.
Trim ( m_gridMap r_lower r_upper ) - returns map clamped into the specified range. It means that all values above r_upper are set to r_upper, and all values below r_lower are set to r_lower.
Trim ( string|S [ all | print ] ) - returns string (or sarray) with removed trailing blanks and carriage returns. If option all is specified, both leading and trailing blank characters will be removed. With print option the non-printable characters will be removed or replaced by similar printable characters.
Trim ( string maxNofCharacters [s_appendWhenTruncated] ) - trims to the maximal number of characters, it may appends specified trailing string if truncated.


Trim("123456",3) # returns "123"
Trim("123456",33) # returns "123456"
Trim("123456",3,"..") # returns "123.."

Trim ( string s_allowed_characters )

- returns string with in which only the allowed characters are retained. All other characters are removed. Example:

 
Trim("as123d","abcds")  
 asd

Trim ( string S_regularExpressionsToDelete )

- returns string in which all listed regular expressions are deleted.
Trim ( S_sarray [ all ] )

- returns sarray of strings with removed trailing blanks. With option all it removes white space characters from both ends.

Trim decorations from a chemical

Trim( X [s_smarts ('[$([*;D1]~[*;R0])]') [i_maxSteps(999) i_minAtomsLeft (0)] ] ) → X_trimmed

iteratively identifies the smarts patterns and deletes it. Arguments:

s_smarts : the pattern identified. The default '[$([*;D1]~[*;R0])]' requests a non-ring neighbor, it means that it will leave one atom attached to the scaffold. If you want to shave all chain decorations, use "[*;D1]"
i_maxSteps : defines the number of iterations (default 999)
i_minAtomsLeft : defines the minimal number of atoms left (default 0). Note that D1 will stop when there is a single carbon left, thus it will never go to zero


add column t Chemical({"C1C(CCNC)CNC1CCCC","",""} )
t.mol[2] = Trim( t.mol[1],"[*;D1]" ) 
t.mol[3] = Trim( t.mol[1] )  # the default will leave one attached atom

See also:

Trim irrelevant expression tag sequences from the domain sequence.

Trim ( seq S_tagRegexps ) → seq_truncated

This function finds the matching regular expressions in the source sequence and deletes it. Note that the order is important and the longer patterns need to precede the shorter ones. The pattern can be N-terminal (use ^) , a fragment in the middle, or C-terminal (use dollar $ ) There is a built in shell array called S_proteinTags that contains popular expression tags:


^.{0,11}HHHHHH
^.{0,5}HHHHH
^.{0,5}DYKDDDDK
DYKDDDDK.{0,3}$
HHHHHH.{0,6}$
YPYDVPDY.{0:3}$
AWRHPQFGG$


 read pdb sequence "1pme"  # contains his-tag
 cleanseq = Trim(1pme_a S_proteinTags ) # built in shell array
 Align(1pme_a cleanseq)
 1pme_a        MSSSHHHHHHSSGLVPRGSHMAAAAAAGAG
 cleanseq      ----------SSGLVPRGSHMAAAAAAGAG

Turn

Turn ( { seq | rs } )

Hutchinson and Thornton (1994)

 
 s =  Sequence("SITCPYPDGVCVTQEAAVIVGSQTRKVKNNLCL") 
 plot comment=String(s) number Turn(s) display # plot Turn prediction

predictSeq

Type

[ Type soap | Type molcart ]

Type ( icm_object_or_keyword )

Type(4.32)

Type(tzWeight)

"real"

"integer", "real", "string", "logical", "iarray", "rarray", "sarray","table", "aselection","vselection","sequence",

"alignment", "profile", "matrix", "map", "grob", "command", "macro", "unknown".

Type (parray , 1 )

parray

"mol"

"model".

Type ( as , 1 )

Type ( os_object , 2 )

- returns a string (or an sarray with keyword object) containing the os_object (or current by default) molecular object type. Defined types follow the EXPDTA (experimental data) card of PDB file with some exceptions, see below:

"ICM" ready for energy calculations. Those objects are either built in ICM or converted to the ICM-type.
"X-Ray" determined by X-ray diffraction
"NMR" determined by NMR
"Model" theoretical model (watch out!)
"Electron" determined by electron diffraction
"Fiber" determined by fiber diffraction
"Fluorescence" determined by fluorescence transfer
"Neutron" determined by neutron diffraction
"Ca-trace" upon reading a pdb, ICM determines if an object is just a Ca-trace.
"Simplified" special object type for protein folding games.
The non-ICM types can be converted to "ICM" with the convert command or convertObject macro.

The non-ICM types can be changed with the set type object command, e.g.


set type a_ "NMR"

Type ( { ms | rs }, 2 ) - returns the string type of the specified molecule or residue. Legal types are "Amino", "Hetatm", "Nucl", "Sugar", "Lipid", "empty". Residues of the "Amino" type can be selected with the 'A' character (e.g. a_/A). See also a one-letter code for the type which is used in selections, ( e.g. a_A,H ).

The molecule type can be reset with the set type ms_ s_type command, e.g. ( set type a_2 "H" to switch to a heteroatom type. Examples:

 
 if (Type(a_1.1)!="Amino") goto skip:      # deal only with proteins  
 if (Type( ) == "NMR") print "Oh, yes!"

Type( ms molecule|all )

Type( rs residue|all )

Type ( as { atom | mmff } )

 
 build string "his ala" 
 show Type(a_//!vt* atom )  # icm types for non-virtual atoms

Type ( as_1 as_2 )