Making a hash of it: The advantage of selectively leaving out structural information
Firstly, this is an overview of a general technique for solving a wide range of cheminformatics problems; namely, the calculation and comparison of hashes of molecules to find duplicates with respect to some structural property of interest.
It also introduces MolHash to the world, a command-line application from NextMove Software which is freely available from GitHub (*).
* If you don't fancy compiling it yourself, you may find Matt Swain's conda version useful.
No comments:
Post a Comment