Command Palette

Search for a command to run...

PodMine
a16z Podcast
a16z Podcast•September 25, 2025

From Vibe Coding to Vibe Researching: OpenAI’s Mark Chen and Jakub Pachocki

OpenAI's Mark Chen and Jakub Pachocki discuss their research journey towards creating an automated researcher, exploring the future of AI reasoning, and the challenges of advancing machine learning capabilities across various scientific domains.
AI & Machine Learning
Tech Policy & Ethics
Developer Culture
Cryptocurrency
Sam Altman
Jakob Ushkoreit
Mark Chen
OpenAI

Summary Sections

  • Podcast Summary
  • Speakers
  • Key Takeaways
  • Statistics & Facts
  • Compelling StoriesPremium
  • Thought-Provoking QuotesPremium
  • Strategies & FrameworksPremium
  • Similar StrategiesPlus
  • Additional ContextPremium
  • Key Takeaways TablePlus
  • Critical AnalysisPlus
  • Books & Articles MentionedPlus
  • Products, Tools & Software MentionedPlus
0:00/0:00

Timestamps are as accurate as they can be but may be slightly off. We encourage you to listen to the full context.

0:00/0:00

Podcast Summary

OpenAI's Chief Scientist Jakob Ushkoreit and Chief Research Officer Mark Chen discuss the groundbreaking GPT-5 launch and OpenAI's ambitious goal of creating an automated researcher. (00:22) The conversation explores how GPT-5 represents a major step toward bringing reasoning capabilities into the mainstream, combining their previous instant-response GPT series with the deeper thinking capabilities of their O-series models. (01:52) The discussion delves into OpenAI's research roadmap focused on extending AI reasoning horizons from current 1-5 hour problem-solving capabilities to much longer timeframes needed for genuine scientific discovery. (07:04)

  • Main Theme: The pursuit of automated research capabilities through advanced reasoning models, with emphasis on extending AI's ability to operate autonomously over increasingly longer time horizons while maintaining quality and discovering genuinely new ideas.

Speakers

Jakob Ushkoreit - Chief Scientist, OpenAI

Jakob serves as OpenAI's Chief Scientist and has been instrumental in developing the company's reasoning capabilities. He has a strong background in competitive programming and has been at OpenAI for nearly a decade, helping guide fundamental research directions from GPT-2 through GPT-5.

Mark Chen - Chief Research Officer, OpenAI

Mark is OpenAI's Chief Research Officer who started as a resident at OpenAI and worked his way up through the organization. He has exceptional talent for both deep technical research and team leadership, playing a crucial role in building and managing research teams that tackle ambitious AI challenges.

Key Takeaways

Shift Focus to Economically Relevant Evaluations

Traditional AI benchmarks are becoming saturated, with models achieving 96-98% performance on many existing evaluations. (03:18) OpenAI is transitioning toward evaluations that measure actual economic impact and real-world discovery capabilities. This represents a fundamental shift from academic benchmarks to measuring whether AI can contribute to genuine scientific and technological progress. The key insight is that as models approach human-level performance on constrained problems, the next frontier involves open-ended research tasks that create new knowledge rather than just demonstrating existing capabilities.

Extended Reasoning Horizons Enable Breakthrough Capabilities

The path to automated research requires extending AI reasoning from current 1-5 hour problem-solving windows to much longer timeframes. (07:05) This involves developing models that can maintain coherent planning and memory over extended periods while making autonomous progress on complex problems. The breakthrough comes from combining reasoning depth with temporal persistence, allowing AI to tackle problems that require sustained effort over days, weeks, or months - similar to how human researchers approach complex scientific challenges.

Reinforcement Learning's Versatility Drives Continuous Progress

RL continues to exceed expectations because it provides a versatile framework for exploring numerous training approaches once anchored to natural language understanding. (11:58) The key breakthrough was solving the "environment problem" through language modeling - giving RL a rich, robust environment to operate within. This combination allows researchers to execute diverse training objectives and continuously discover new directions for improvement, explaining why RL gains haven't plateaued despite repeated predictions of diminishing returns.

Protect Fundamental Research While Balancing Product Needs

Successful AI organizations must create clear boundaries between fundamental research and product development to maintain innovation velocity. (30:14) This requires giving researchers space to focus on long-term breakthroughs without constant product pressures, while also having dedicated teams that bridge research advances into practical applications. The strategy involves protecting researchers from being pulled in multiple product directions while ensuring alignment on the ultimate vision of automated research capabilities.

Persistence and Honest Self-Assessment Define Great Researchers

Exceptional researchers combine deep conviction in important problems with ruthless honesty about experimental results. (20:37) The key is maintaining persistence through inevitable failures while staying truthful about progress and learning from setbacks. Great researchers target widely recognized but seemingly intractable problems, constantly questioning why current approaches fail and what barriers prevent the next breakthrough. This mindset enables sustained effort on multi-year challenges while avoiding the trap of trying to prove ideas work rather than genuinely testing them.

Statistics & Facts

  1. GPT-5 achieved number two ranking in the AtCoder programming competition, with only the top spot remaining. (05:17) This demonstrates near-mastery of high-level competitive programming challenges that traditionally required years of training for human experts.
  2. Current reasoning models can operate effectively over 1-5 hour problem-solving horizons. (07:25) This timeframe represents the current frontier for sustained AI reasoning before quality degrades or models lose coherence.
  3. OpenAI has been targeting automated researcher capabilities as their primary research goal for "a couple years now." (33:46) This timeline indicates the sustained focus and strategic commitment behind their reasoning model development.

Compelling Stories

Available with a Premium subscription

Thought-Provoking Quotes

Available with a Premium subscription

Strategies & Frameworks

Available with a Premium subscription

Similar Strategies

Available with a Plus subscription

Additional Context

Available with a Premium subscription

Key Takeaways Table

Available with a Plus subscription

Critical Analysis

Available with a Plus subscription

Books & Articles Mentioned

Available with a Plus subscription

Products, Tools & Software Mentioned

Available with a Plus subscription

More episodes like this

In Good Company with Nicolai Tangen
January 14, 2026

Figma CEO: From Idea to IPO, Design at Scale and AI’s Impact on Creativity

In Good Company with Nicolai Tangen
We Study Billionaires - The Investor’s Podcast Network
January 14, 2026

BTC257: Bitcoin Mastermind Q1 2026 w/ Jeff Ross, Joe Carlasare, and American HODL (Bitcoin Podcast)

We Study Billionaires - The Investor’s Podcast Network
Uncensored CMO
January 14, 2026

Rory Sutherland on why luck beats logic in marketing

Uncensored CMO
This Week in Startups
January 13, 2026

How to Make Billions from Exposing Fraud | E2234

This Week in Startups
Swipe to navigate