Search for a command to run...

Timestamps are as accurate as they can be but may be slightly off. We encourage you to listen to the full context.
Professor Jim Collins from MIT has developed an AI-powered approach that successfully creates novel antibiotics using machine learning techniques on remarkably small datasets. His team trains convolutional graph neural networks on just a few thousand chemical compounds, then screens tens of millions of potential antibiotics in silico to identify promising candidates. (27:00) The process has yielded several breakthrough antibiotic discoveries, including Halicin - a powerful new antibiotic that works against drug-resistant strains through entirely new mechanisms of action. (26:31)
Termeer Professor of Medical Engineering at MIT and leader of groundbreaking AI-powered antibiotic discovery research. Collins has been working on antibiotics for over twenty years and has used machine learning in this context from the very beginning to reverse engineer biomolecular networks inside bacteria. He runs labs at both MIT and the Broad Institute, where he developed the revolutionary AI approach that discovered Halicin and other novel antibiotics.
Collins' team achieved remarkable success with just 2,500 compounds in their initial training dataset - orders of magnitude smaller than typical AI applications. (27:27) While AI colleagues initially dismissed the approach as having "far too little data," the team achieved a 51-52% true positive rate compared to less than 1% for random screening. The key insight is that having quality positive examples of antibiotics in the training set, even with binary classification, provides sufficient signal for the model to learn meaningful chemical structures associated with antibacterial activity.
The choice of convolutional graph neural networks proved crucial for learning chemical structures bond by bond and substructure by substructure. (27:19) These models can process molecular structures as graphs, making them naturally suited for chemical applications. Collins notes that while they've explored large language models for chemistry, the graph neural networks still outperform them, likely because they're specifically designed to understand molecular relationships and chemical bonds.
The AI prediction is just the first step in a comprehensive pipeline that scores candidates on multiple criteria: antibacterial activity, novelty compared to existing antibiotics, non-toxicity to human cells, chemical stability, and synthesizability. (58:54) This multi-objective approach helps narrow down from millions of predicted compounds to hundreds worth synthesizing, dramatically improving the hit rate and reducing costly synthesis failures.
Despite AI's power, medicinal chemists still outperform models in certain tasks. Collins describes how a postdoc outperformed their synthesizability model when evaluating 30,000 compounds, using an approach focused on identifying molecular "liabilities." (61:01) The key insight was that the human expert looked for "bad, bad, bad" features and only approved molecules without obvious liabilities - a different approach than how the AI model was trained.
Some AI-discovered antibiotics show remarkable resistance to developing bacterial resistance. Halicin maintained effectiveness over 30 days while conventional antibiotics like Cipro showed significant resistance within days. (76:08) This likely occurs because AI-identified compounds target multiple molecular targets simultaneously, requiring bacteria to develop mutations across multiple sites - a much more difficult evolutionary challenge than single-target resistance.