Charting Chemical Space with Computers: Challenges and Opportunities for AI and Machine Learning

Pierre Baldi

University of California, Institute for Genomics and Bioinformatics, School of Information and Computer Sciences, Irvine, USA

Informatics and computers have not yet become as pervasive in chemistry as they have in physics and biology. Drawing analogies from bioinformatics, key ingredients for progress in chemoinformatics are the availability of large, annotated databases of compounds and reactions, data structures and algorithms to efficiently search these databases, and computational methods to predict the physical, chemical, and biological properties of new compounds and reactions. We will describe the development of: (1) a large public database of compounds and reactions (ChemDB) (2) machine learning kernel methods to predict molecular properties and (3) the applications of these methods to drug screening/design problems and the identification of new drug leads against a major disease.

Back