Search for a command to run...

Timestamps are as accurate as they can be but may be slightly off. We encourage you to listen to the full context.
OpenAI's Chief Scientist Jakob Ushkoreit and Chief Research Officer Mark Chen discuss the groundbreaking GPT-5 launch and OpenAI's ambitious goal of creating an automated researcher. (00:22) The conversation explores how GPT-5 represents a major step toward bringing reasoning capabilities into the mainstream, combining their previous instant-response GPT series with the deeper thinking capabilities of their O-series models. (01:52) The discussion delves into OpenAI's research roadmap focused on extending AI reasoning horizons from current 1-5 hour problem-solving capabilities to much longer timeframes needed for genuine scientific discovery. (07:04)
Jakob serves as OpenAI's Chief Scientist and has been instrumental in developing the company's reasoning capabilities. He has a strong background in competitive programming and has been at OpenAI for nearly a decade, helping guide fundamental research directions from GPT-2 through GPT-5.
Mark is OpenAI's Chief Research Officer who started as a resident at OpenAI and worked his way up through the organization. He has exceptional talent for both deep technical research and team leadership, playing a crucial role in building and managing research teams that tackle ambitious AI challenges.
Traditional AI benchmarks are becoming saturated, with models achieving 96-98% performance on many existing evaluations. (03:18) OpenAI is transitioning toward evaluations that measure actual economic impact and real-world discovery capabilities. This represents a fundamental shift from academic benchmarks to measuring whether AI can contribute to genuine scientific and technological progress. The key insight is that as models approach human-level performance on constrained problems, the next frontier involves open-ended research tasks that create new knowledge rather than just demonstrating existing capabilities.
The path to automated research requires extending AI reasoning from current 1-5 hour problem-solving windows to much longer timeframes. (07:05) This involves developing models that can maintain coherent planning and memory over extended periods while making autonomous progress on complex problems. The breakthrough comes from combining reasoning depth with temporal persistence, allowing AI to tackle problems that require sustained effort over days, weeks, or months - similar to how human researchers approach complex scientific challenges.
RL continues to exceed expectations because it provides a versatile framework for exploring numerous training approaches once anchored to natural language understanding. (11:58) The key breakthrough was solving the "environment problem" through language modeling - giving RL a rich, robust environment to operate within. This combination allows researchers to execute diverse training objectives and continuously discover new directions for improvement, explaining why RL gains haven't plateaued despite repeated predictions of diminishing returns.
Successful AI organizations must create clear boundaries between fundamental research and product development to maintain innovation velocity. (30:14) This requires giving researchers space to focus on long-term breakthroughs without constant product pressures, while also having dedicated teams that bridge research advances into practical applications. The strategy involves protecting researchers from being pulled in multiple product directions while ensuring alignment on the ultimate vision of automated research capabilities.
Exceptional researchers combine deep conviction in important problems with ruthless honesty about experimental results. (20:37) The key is maintaining persistence through inevitable failures while staying truthful about progress and learning from setbacks. Great researchers target widely recognized but seemingly intractable problems, constantly questioning why current approaches fail and what barriers prevent the next breakthrough. This mindset enables sustained effort on multi-year challenges while avoiding the trap of trying to prove ideas work rather than genuinely testing them.