Command Palette

Search for a command to run...

PodMine
Latent Space: The AI Engineer Podcast
Latent Space: The AI Engineer Podcast•December 30, 2025

[Latent Space LIVE @ NeurIPS] State of AI Startups 2025 — with Sarah Catanzaro, Amplify Partners

Live from NeurIPS 2025, Sarah Catanzaro from Amplify Partners discusses the state of AI startups, covering topics like the DBT-Fivetran merger, the crazy funding environment, world models, and her investment thesis focused on research-driven applications solving hard technical problems like RAG, rule-following, and continual learning.
AI & Machine Learning
Indie Hackers & SaaS Builders
Tech Policy & Ethics
Developer Culture
Data Science & Analytics
B2B SaaS Business
Sarah Carranzaro
Bar

Summary Sections

  • Podcast Summary
  • Speakers
  • Key Takeaways
  • Statistics & Facts
  • Compelling StoriesPremium
  • Thought-Provoking QuotesPremium
  • Strategies & FrameworksPremium
  • Similar StrategiesPlus
  • Additional ContextPremium
  • Key Takeaways TablePlus
  • Critical AnalysisPlus
  • Books & Articles MentionedPlus
  • Products, Tools & Software MentionedPlus
0:00/0:00

Timestamps are as accurate as they can be but may be slightly off. We encourage you to listen to the full context.

0:00/0:00

Podcast Summary

Sarah Catanzaro from Amplify Partners offers a veteran data investor's perspective on AI's evolution and the intersection between data infrastructure and artificial intelligence. (01:02) She discusses the DBT-Fivetran merger as a strategic IPO preparation move rather than the death of the modern data stack, revealing how major AI labs are adopting these same data tools for training data management and agent analytics. The conversation explores the concerning $100M+ seed funding phenomenon where companies raise massive rounds without clear six-month roadmaps, the overhyped but underspecified world models category, and her thesis that personalization through memory management and continual learning will be the key to solving AI application retention and churn problems in 2026.

• Main Theme: The symbiotic relationship between data infrastructure and AI, with a critical examination of current funding trends and emerging opportunities in personalization and memory management for AI applications.

Speakers

Sarah Catanzaro

Sarah is a partner at Amplify Partners where she focuses on AI infrastructure and applications after previously investing through the modern data stack era with companies like DBT. She started her career in symbolic AI systems before transitioning to data infrastructure, driven by a desire to understand what happens when SQL queries are executed. Sarah has been at the intersection of data, compute, and intelligence for years, watching categories emerge, merge, and evolve from the analytics explosion to today's AI frontier.

Key Takeaways

The Modern Data Stack Isn't Dead—It's IPO-Ready

The DBT-Fivetran merger signals strategic consolidation for IPO preparation rather than category failure. (01:02) Both companies were beating revenue targets and growing healthily, but needed to reach the new IPO threshold of $600M+ combined revenue. Major AI labs are actually heavy users of both DBT and Fivetran for training data curation and agent analytics, proving the tools remain relevant in the AI era. The merger represents the natural evolution of category winners positioning for liquidity rather than a retreat from market demand.

AI Labs Have Sophisticated Data Infrastructure Needs

Frontier AI companies are paying careful attention to their data stacks, from data discoverability to efficient GPU data loading. (08:42) Sarah notes that training datasets need management, user interactions with agents require complex analytics, and GPU idle time from inefficient data loading creates significant cost implications. Surprisingly, much existing data infrastructure has scaled elegantly to meet AI use cases, though new challenges around ad hoc workloads and transactional database requirements (like OpenAI using RocksDB) are emerging.

$100M+ Seed Rounds Signal Dangerous Market Dynamics

The current funding environment features companies raising massive seed rounds ($100M+) at billion-dollar valuations without clear six-month roadmaps. (10:13) Founders are optimizing for signal and prestige rather than partnership or dilution discipline, creating seven-day decision windows that prevent proper due diligence. This dynamic makes it impossible to assess whether teams can execute on their long-term vision, as investors lack time to build conviction about founders' capabilities while founders focus on transactional relationships rather than strategic partnerships.

Personalization Through Memory and Continual Learning Is the 2026 Unlock

AI application companies suffer from low retention and high churn because products lack meaningful personalization. (19:01) The solution lies in memory management and continual learning systems that don't just store facts but learn new skills from user interactions and adapt as the world changes. This represents the consumerization of AI—moving beyond magical but static experiences to products that become more valuable over time through personalization, similar to how consumer enterprise tools disrupted traditional software a decade ago.

RL Environments Are Overhyped—Real-World Data Is Superior

Despite labs paying 7-8 figures for synthetic RL environments, the best environment is the real world itself. (23:37) Real user logs, traces, and activity data (like Cursor uses) are richer, cheaper, and more generalizable than expensive synthetic clones. While some aspects of environment design remain relevant (rubrics, task definition), building app clones represents misallocated resources when authentic user behavior data provides superior training signals for improving AI systems.

Statistics & Facts

  1. The new IPO revenue threshold is $600M+ combined annual revenue, significantly higher than the previous $100M benchmark that Sarah mentions as insufficient for today's public market environment. (01:53)
  2. AI labs are paying 7-8 figures for RL environments when real-world logs and traces would provide superior training data at lower cost. (24:17)
  3. The DBT-Fivetran combined entity will be "close to 600" million in revenue, positioning them for IPO readiness in the current market environment. (01:53)

Compelling Stories

Available with a Premium subscription

Thought-Provoking Quotes

Available with a Premium subscription

Strategies & Frameworks

Available with a Premium subscription

Similar Strategies

Available with a Plus subscription

Additional Context

Available with a Premium subscription

Key Takeaways Table

Available with a Plus subscription

Critical Analysis

Available with a Plus subscription

Books & Articles Mentioned

Available with a Plus subscription

Products, Tools & Software Mentioned

Available with a Plus subscription

More episodes like this

In Good Company with Nicolai Tangen
January 14, 2026

Figma CEO: From Idea to IPO, Design at Scale and AI’s Impact on Creativity

In Good Company with Nicolai Tangen
We Study Billionaires - The Investor’s Podcast Network
January 14, 2026

BTC257: Bitcoin Mastermind Q1 2026 w/ Jeff Ross, Joe Carlasare, and American HODL (Bitcoin Podcast)

We Study Billionaires - The Investor’s Podcast Network
Uncensored CMO
January 14, 2026

Rory Sutherland on why luck beats logic in marketing

Uncensored CMO
This Week in Startups
January 13, 2026

How to Make Billions from Exposing Fraud | E2234

This Week in Startups
Swipe to navigate