sigmoid.social is one of the many independent Mastodon servers you can use to participate in the fediverse.
A social space for people researching, working with, or just interested in AI!

Server stats:

586
active users

#rdkit

0 posts0 participants0 posts today

🎙️ Join Franciszek Job at EuroSciPy as he presents a scalable framework to unify chemical datasets from sources like PubChem, UniChem & COCONUT.

💻 Canonicalize with RDKit
⚡ Scale via Dask
🔁 Deduplicate with InChI keys

Ideal for ML pretraining, benchmarking, and chemical data analysis.

📅 Schedule: lnkd.in/eaAxwUN2
🎟️ Tickets: lnkd.in/end9aYzE

lnkd.inLinkedInThis link will take you to a page that’s not on LinkedIn

Just announced "talus - tools to make SMARTS-based #cheminformatics fingerprints" to the #RDKit mailing list.

hg.sr.ht/~dalke/talus

I'm able to process SMILES file -> Klekota-Roth SMARTS-based fingerprints at about 12.5x baseline speed.

RDMol -> fingerprints performance is 13.2x baseline.

It includes some SMARTS analysis methods to tell if a SMARTS atom pattern X is a subset of a SMARTS atom pattern Y.