Search for a command to run...

Timestamps are as accurate as they can be but may be slightly off. We encourage you to listen to the full context.
Stack Overflow's Ryan Donovan sits down with Dhruv Batra, cofounder and chief scientist at Yatori, to explore the fascinating world of proactive AI agents. Unlike traditional reactive agents that wait for prompts, Batra's team is building agents that continuously monitor the web and take initiative on behalf of users. With nearly 20 years in AI research spanning computer vision, robotics, and natural language processing, Batra shares his journey from Georgia Tech professor to Meta's FAIR division leader. (10:11)
Host of the Stack Overflow podcast and blog editor at Stack Overflow. Donovan guides technical discussions and explores emerging technology trends with industry leaders.
Cofounder and Chief Scientist at Yatori with nearly 20 years of AI research experience. Former Senior Director at Meta's FAIR (Fundamental AI Research) division leading Embodied AI initiatives. Previously a professor at Georgia Tech where he created their deep learning curriculum, and his teams developed breakthrough technologies including image question-answering models for Ray-Ban Meta sunglasses and the world's fastest 3D simulator called Habitat for training virtual robots.
Batra argues that the AI community is engaging in "opportunistic redefinition" of AGI, narrowing its scope to exclude physical intelligence, robotics, and tactile sensing. (07:07) True AGI, he contends, should encompass the original 1950s vision of intelligent agents that can interact with both digital and physical worlds. This perspective helps professionals maintain realistic expectations while recognizing the significant progress in natural language interfaces. By understanding these limitations, leaders can make more informed decisions about AI implementation and avoid overestimating current capabilities.
The current web economy built on human attention and advertisements will undergo fundamental restructuring as AI agents become primary web traffic. (16:06) Batra envisions a fairer value exchange where agents represent humans with high purchase intent, creating new economic incentives beyond the attention economy. Professional service providers should prepare for this shift by considering how to monetize agent interactions and create value propositions that work for both human-directed agents and traditional users.
Unlike traditional short-lived AI interactions, proactive agents operate continuously over weeks or months, creating sophisticated narrative tracking capabilities. (31:08) Batra's example of a scout that tracked Meta's acquisition activities for ten weeks, evolving its understanding and expanding its scope autonomously, demonstrates the power of persistent intelligence. This represents a paradigm shift from one-time problem solving to continuous intelligence gathering and analysis, offering professionals unprecedented monitoring and analysis capabilities.
Rather than building specific scrapers for narrow use cases, Yatori takes an "intelligence first" approach that can handle any web-based task a human could perform. (20:20) This general approach means agents can adapt to changing websites, understand context, and optimize monitoring frequency based on the type of information being tracked. Professionals should consider this flexibility when implementing AI solutions, focusing on general intelligence rather than brittle, specific automation that breaks when conditions change.
Yatori's product roadmap demonstrates a strategic approach to building user trust by starting with read-only monitoring before progressing to write actions like booking reservations. (19:11) This graduated approach acknowledges that different mistakes have different costs and allows users to develop confidence in agent capabilities incrementally. Organizations implementing AI agents should adopt similar risk-graduated approaches, beginning with low-stakes tasks and progressively expanding agent authority as reliability is demonstrated.