The Cluster Package enables one to generate clusters of compounds based on the Daylight Fingerprint descriptor and the Jarvis-Patrick clustering algorithm. Subsets of large datasets can be selected as well as clustering data added to TDT files for insertion into Daylight Databases. Keep track of files from this exercise for use in Day 2 labs.
$DY_ROOT/bin/smi2tdt -t '$SMI' cluster.smi cluster.tdt
fingerprint -b 1024 -c 1024 -id day1 cluster.tdt >cluster.fp.tdt
nearneighbors -NEIGHBORS 5 cluster.fp.tdt cluster.nn.tdt
jpscan -NN_BEST_THRESHOLD 0.7 -JP_NEAR 5 cluster.nn.tdt jpscan.out
showclusters
jarpat -JP_NEED 3 -JP_NEAR 5 cluster.nn.tdt >cluster.cl35.tdt
showclusters -h -q -v cluster.cl35.tdt >cluster.cl35.out
listclusters -a cluster.cl35.tdt >cluster.lc.tdt