Join Franciszek Job at EuroSciPy as he presents a scalable framework to unify chemical datasets from sources like PubChem, UniChem & COCONUT.
Canonicalize with RDKit
Scale via Dask
Deduplicate with InChI keys
Ideal for ML pretraining, benchmarking, and chemical data analysis.
Schedule: https://lnkd.in/eaAxwUN2
Tickets: https://lnkd.in/end9aYzE