Assembling off-the-shelf Components Into Useful Applications,
TJ O'Donnell, MUG2004
tdt2histo and filter_dayprop
The tdt2histo utility reads in TDTs containing
numerical data, and produces HTML tables
showing histogram bin counts of the data values.
tdt2histo is a Perl function meant to be "required"
by a Perl main.
The filter_dayprop utility
converts the generic PPROP tags output by dayprop into
more meaningful names, such as AVERAGE_MOL_WT.
filter_dayprop is a Perl application.
The following command sequence dumps the nci00demo database,
computes all the dayprops and adds meaningful tag names.
$DY_ROOT/bin/thorlist nci00demo | $DY_ROOT/bin/dayprop -property ALL | filter_dayprop > nci00demo_props.tdt
Sample output from tdt2histo
585
116.20 |
992
425.83 |
240
735.46 |
51
1045.09 |
13
1354.72 |
5
1664.35 |
1
1973.98 |
2283.61 |
2593.24 |
2902.87 |
1
3212.5 |
AVERAGE_MOL_WT 1937
468
45.10 |
1055
184.191 |
310
323.282 |
78
462.373 |
17
601.464 |
6
740.555 |
2
879.646 |
1018.737 |
1157.828 |
1296.919 |
1
1436.01 |
HDONOR_COUNT 1937
1273
0 |
501
2 |
122
4 |
38
6 |
2
8 |
10 |
1
12 |
RIGIDITY 1937
28
0.1562 |
130
0.24058 |
266
0.32496 |
327
0.40934 |
352
0.49372 |
257
0.5781 |
193
0.66248 |
113
0.74686 |
42
0.83124 |
20
0.91562 |
209
1 |
FRAGMENT_COUNT 1937
1069
0 |
758
3 |
93
6 |
10
9 |
5
12 |
15 |
18 |
1
21 |
1
24 |
PART_COUNT 1970
1937
1 |
33
2 |
FLEXIBILITY 1937
469
0.00 |
530
0.095 |
364
0.19 |
194
0.285 |
147
0.38 |
93
0.475 |
53
0.57 |
44
0.665 |
30
0.76 |
12
0.855 |
1
0.95 |
ACCURATE_MASS 1937
470
45.057850 |
1054
184.0325215 |
309
323.007193 |
78
461.9818645 |
17
600.956536 |
6
739.9312075 |
2
878.905879 |
1017.8805505 |
1156.855222 |
1295.8298935 |
1
1434.804565 |
RING_COUNT 1937
292
0 |
630
1 |
550
2 |
274
3 |
127
4 |
41
5 |
17
6 |
6
7 |
ROTBOND_COUNT 1937
1677
0 |
204
8 |
41
16 |
10
24 |
2
32 |
1
40 |
1
48 |
56 |
64 |
1
72 |
MOLAR_VOLUME 1888
651
56.00 |
952
212.1 |
221
368.2 |
45
524.3 |
15
680.4 |
2
836.5 |
992.6 |
1
1148.7 |
1304.8 |
1460.9 |
1
1617 |
POLAR_SURFACE_AREA 1937
839
0.00 |
760
42.044 |
246
84.088 |
66
126.132 |
15
168.176 |
3
210.22 |
6
252.264 |
1
294.308 |
336.352 |
378.396 |
1
420.44 |
STEREOCENTER_COUNT 1937
1741
0 |
126
3 |
39
6 |
26
9 |
2
12 |
1
15 |
1
18 |
1
21 |
HACCEPTOR_COUNT 1937
1021
0 |
753
6 |
126
12 |
24
18 |
6
24 |
5
30 |
1
36 |
42 |
1
48 |
ATOM_COUNT 1937
489
3 |
1061
13 |
307
23 |
61
33 |
11
43 |
6
53 |
1
63 |
73 |
83 |
1
93 |
1971