Senomyx
Mission:
To discover new and improved flavor and fragrance molecules through integrating our enhanced understanding of the biology of taste and smell with high throughput molecular discovery technology.
A
rapid, iterative process based on a system of tightly integrated and automated
just-in-time components of molecular discovery discipline.
• Knowledge-based Computational Library Design
• High Throughput Synthesis
• Analysis and Purification
• Assay Development
• High Throughput Screening
Informatics
Foundation and infrastructure:
• Oracle Database (Senobase)
• Daylight Oracle Cartridge and toolkits
• Modular server-side software components (VB, Java)
• Web-based user interfaces
• Numerous behind the scenes middleware that sets the foundation
for rapid application development.
·
Structure/substructure
searching
·
List
generation
·
List
management
·
Method
writer
Computational
molecular design:
·
Diversity
space is limited with matrix synthesis
·
Goal
is to expand diversity space through cherry picking
·
Selection
from virtual library
·
2-D
topology-based methods
·
Nearest
neighbor approach
·
Genetic
algorithm used for searching
·
Cherry
picking hundreds for synthesis
·
Fully
integrated into workflow
Virtual
libraries:
·
Defined
by a generic reaction (A+B+C => ABC)
·
Scope
of viable reagents
·
Novelty
and relevance
·
The
sum of all virtual libraries constitutes the search space for target guided
design
1. Start with a randomly generated seed
population of n candidate reactions or from a pre-existing population
2. Repeat the following steps until n offspring have been created:
·
Select
a pair of parent candidates from the current population
·
Apply
a singe-point crossover operator in a chemically meaningful way; keep one new
offspring randomly
·
Mutate
the offspring with specified mutation probability
3. Combine new and old populations
4. Calculate the fitness of each candidate in the combined
population
5. Sort population by fitness and cull the less fit half.
6. Go to step 2.
•Maximize
activity in biological assay
Use a “training set” of compounds whose activities have been measured in a biological assay. Estimate the activity of each new compound by selecting its 10 nearest neighbors from the training set and taking a weighted average of their activities. Nearest neighbors are determined by Tanimoto coefficients of similarity for 2D binary fingerprint comparison.
•Maximize
population diversity
Diversity of a given set of compounds D(S) is estimated as
where n is the number of compounds in S and d
is the diversity between two
molecules based on Tanimoto coefficient for binary
fingerprint comparison
Ref. Dimitris K. Agrafiotis & Victor S. Lobanov, J.
Chem. Inf. Comput. Sci. 1999, 39, 51-58
•Optimize
reagent use
Automated
library synthesis:
·
Chemistry
definition (reaction transform – SMIRKS
– and building block lists)
·
Library
enumeration (GA)
·
Run
preparation (material balance)
·
Chemistry
services (bulk to automation ready)
·
Automated
synthesis
·
Plate
reformatting
·
Purification
and LC/MS analysis
·
High
throughput screening
Summary:
• Design,
synthesis, and screening cycle is ~1 week
• Current
throughput is several thousand compounds per week
• >50,000
compounds have been synthesized and screened using more than 40 automated
combinatorial reactions
• Hundreds of
active molecules have been identified from a total of over 10 chemical classes