|
This script scores ensemble clusters previously generated with
enscluster.pl. As with the other utilities
for ensembles a tag is required for identifying
the structure set and the ensemble directory may be given with
-dir.
By default the final clusters at the bottom of the cluster
hierarchy are scored. Alternatively, clusters at a specific level
may be scored instead by specifying the level through the
-level option.
The total energy (etot) is used for scoring as the default
property. A different property may be chosen with -prop.
A number of different methods are available to obtain a single
score for each cluster from the property values of the cluster members.
The default method is to calculate a simple average for all members.
Other methods are available with the -crit option followed
by a corresponding keyword. With avglow and avgcent
cluster members with a property value outside the standard distribution around
the mean are ignored in calculating the average property value. If
avglow is selected only values on the high end of the distribution
are ignored, with avgcent extreme values on both sides of the
distribution are omitted from the calculation of the average. avglow
is particularly useful with energy values if a small number of erroneously
high energies occur due to structural distortions that are not resolved
during minimization. If avglow or avgcent is selected
the option -limstd may be used to change the limit in multiples of
the standard deviation for excluding cluster members.
The best value is used as the score with best, the average
over the best num structures with best. Finally,
with median the median value is taken as the cluster score.
In the output the clusters are sorted according to the score (or according
to their size if -size is given). For each cluster it consists of
the number of total members, the number of members used in calculating the score,
the score itself and, if applicable, the standard deviation and the statistical
error of the score based on the standard deviation and the number of values used
in calculating the score. If the option -lowest is given additional fields
contain the energy and filename of the lowest energy conformation for each cluster.
|