Search for a command to run...

Timestamps are as accurate as they can be but may be slightly off. We encourage you to listen to the full context.
This episode features Shai Shen Orr, Co-Founder and Chief Scientist at CytoReason and Professor at the Technion, discussing how his company is bridging the critical "data-insight gap" in drug development. (04:00) Orr explains how biology has evolved from a "one tube, one result" field to a "one tube, a million results" field, creating massive analytical challenges where data grows exponentially but insights remain linear. (05:08) CytoReason, founded in 2016, serves major pharmaceutical companies like Pfizer and Sanofi by providing AI-powered disease models that help researchers make data-driven decisions throughout the drug development lifecycle. The conversation explores how agentic AI workflows are revolutionizing literature curation, data processing, and decision-making in an industry where a single drug costs $2.5 billion to develop and faces a 90% failure rate.
Shai Shen Orr is Co-Founder and Chief Scientist at CytoReason and Professor of Systems Immunology and Precision Medicine at the Technion Israel Institute of Technology. (01:31) He has been working in computational biology and data science since the late 1990s, transitioning from AI work to life sciences when he realized the profound humanitarian potential of applying machine learning to healthcare challenges. As a systems immunologist, Orr recognized early that biology was generating more data than could be manually analyzed, leading him to co-found CytoReason in 2016 to build scalable AI solutions for pharmaceutical companies.
Orr emphasizes that in rapidly evolving fields like biology, professionals must "run just to stay in place" due to exponential data growth. (12:40) He implements a company-wide principle where employees spend 80% of their time on current responsibilities and 20% automating their own jobs. This approach prevents teams from being overwhelmed by the data avalanche while continuously advancing capabilities. The key insight is recognizing that manual processes become unsustainable when data grows exponentially - in immunology, a new paper is published every two minutes. (10:42) Professionals must proactively identify which aspects of their work can be automated before they become bottlenecks.
In fields where there are "way more features than samples" - what Orr calls "deep data" - traditional machine learning approaches fail due to overfitting. (15:38) The solution involves integrating prior knowledge from literature and established research to narrow down search spaces and provide confidence in predictions. This approach serves dual purposes: it makes discoveries more feasible and gives stakeholders the confidence needed to make expensive decisions. Rather than relying solely on black-box predictions, professionals should build mechanistic models that explain why predictions make sense, connecting novel discoveries to established knowledge.
CytoReason's approach combines deep learning, traditional statistics, and rule-based systems rather than relying on a single AI methodology. (28:19) Orr explains that in biology, there are few areas where you can simply input data into a deep learning model and achieve good performance - most domains lack sufficient training data. The solution is creating integrated frameworks that call different services tailored to specific components. This hybrid approach acknowledges that different types of problems require different AI solutions, and the most effective systems intelligently orchestrate multiple approaches rather than forcing everything through one model type.
When AI recommendations influence expensive experiments or life-affecting treatments, confidence scoring becomes critical. (16:57) CytoReason uses multiple techniques including retrieval-augmented generation, literature sampling, and biological credibility checks to generate confidence scores for their predictions. The goal is balancing the necessity of AI acceleration with the high quality required for pharmaceutical decision-making. Professionals should implement feedback loops that continuously improve confidence assessments rather than treating AI outputs as definitive answers.
While AI has made significant advances in chemistry and clinical operations, Orr identifies biology as the largest unsolved challenge in pharmaceutical development. (23:42) Phase 2 clinical trials, where drugs are first tested in humans, show the highest failure rates because of poor understanding of disease biology and human diversity. The search space in biology is vastly larger than in chemistry, making it both the most challenging and potentially most impactful area for AI advancement. Professionals should recognize that tackling the hardest problems often offers the greatest opportunities for breakthrough impact.