Daylight Summer School 2001, June 5-7, Santa Fe, NM

Daylight Worksheet - Cluster Package -- SHOW HINTS!


The Cluster Package enables one to generate clusters of compounds based on the Daylight Fingerprint descriptor and the Jarvis-Patrick clustering algorithm. Subsets of large datasets can be selected as well as clustering data added to TDT files for insertion into Daylight Databases. Keep track of files from this exercise for use in Day 2 labs.

  1. Generate a TDT file containing a clustered dataset from "cluster.smi" dataset which uses fixed length fingerprints 5 nearest neighbors and tanimoto threshold of 0.7, and a "reasonable" JP clustering level chosen from jpscan output.

  2. Pick a representative subset of the clustered dataset from step one by selecting only the cluster centroids and the singletons.

    listclusters -a cluster.cl35.tdt >cluster.lc.tdt

  3. Update the nearneighbors table generated from the cluster.tdt dataset with the "drugs.smi" dataset fingerprinted with the same parameter set used in step one.


Daylight Chemical Information Systems Inc.
support@daylight.com