We benchmark on the community-standard Dalke NN dataset (1,000 high-similarity ChEMBL pairs) — the same dataset widely used by RDKit, CDK, and the academic MCS literature. Identical SMILES input, same ...
The Fast Vector Similarity Library is a high-performance Rust-based tool for efficiently computing similarity measures between vectors. It is ideal for data analysis, machine learning, and statistical ...