Molecular similarity is a tricky concept, mostly because there are many ways to define and measure similarity. For example, two molecules could be considered similar because they have the same biological effect, or because they have identical molecular weight, or because they contain the same functional groups, etc., etc. A natural follow-on question from this is “what is the correct way to measure molecular similarity?” and the answer, unfortunately, is that it depends.
As an example of these complexities, Greg Landrum has a great blog post on how Tanimoto similarity changes depending on how you vectorise a molecule, and the need for authors to clarify the vectorisation method used. Variation in Tanimoto similarities is also something Ísak has written about on blopig.
Continue reading



