This show goes behind the scenes for the tools, techniques, and difficulties associated with the discipline of data engineering. Databases, workflows, automation, and data manipulation are just some of the topics that you will find here.
Support the show!Listen in your favorite app:
FountainHere are shows you might like
Summary In this episode Robert Nishihara, co-founder of Anyscale and co-creator of Ray, talks about maximizing hardware utilization for AI and data-intensive workloads. He explores Ray’s evolution alongside Kubernetes and PyTorch, and why consolidation at these layers has enabled a new generation of complex, heterogeneous workloads. Robert explains…
Summary In this episode Robert Nishihara, co-founder of Anyscale and co-creator of Ray, talks about…
06 May 2026 | 00:58:34
Summary In this episode, I sit down with Gleb Mezhanskiy, CEO and co-founder of Datafold, to explore how agentic AI is reshaping data engineering. We unpack the leap from chat-assisted coding to truly agentic workflows where AI not only writes SQL and dbt models but also executes queries, debugs, runs tests, and ships production-ready outcomes.…
Summary In this episode, I sit down with Gleb Mezhanskiy, CEO and co-founder of Datafold, to…
07 April 2026 | 00:59:24
Summary In this episode Himant Goyal, Senior Product Manager at Salesforce, talks about how data platform investments enable reliable, accurate metering for consumption-based business models. Himant explains why consumption turns operations into a real-time optimization problem spanning metering, cost attribution, billing, governance, and…
Summary In this episode Himant Goyal, Senior Product Manager at Salesforce, talks about how data…
29 March 2026 | 00:50:19
Summary In this episode Rowan Cockett, co-founder and CEO of CurveNote and co-founder of the Continuous Science Foundation, talks about building data systems that make scientific research reproducible, reusable, and easier to communicate. He digs into the sociotechnical roots of the reproducibility crisis - from data integrity and access to…
Summary In this episode Rowan Cockett, co-founder and CEO of CurveNote and co-founder of the…
22 March 2026 | 00:42:40
Summary In this episode Raj Shukla, CTO of SymphonyAI, explores what it really takes to build self‑improving AI systems that work in production. Raj unpacks how agentic systems interact with real-world environments, the feedback loops that enable continuous learning, and why intelligent memory layers often provide the most practical middle ground…
Summary In this episode Raj Shukla, CTO of SymphonyAI, explores what it really takes to build…
16 March 2026 | 01:01:50
Summary In this episode of the Data Engineering Podcast, Lucas Thelosen and Drew Gilson, co-founders of Gravity, discuss their vision for agentic analytics in the enterprise, enabled by semantic layers and broader context engineering. They share their journey from Looker and Google to building Orion, an AI analyst that combines data semantics with…
Summary In this episode of the Data Engineering Podcast, Lucas Thelosen and Drew Gilson,…
08 March 2026 | 01:05:01
Summary In this episode of the Data Engineering Podcast, Jamie Knowles (Product Director) and Ryan Hirsch (Product Marketing Manager) discuss the importance of enterprise data modeling with ER/Studio. They highlight how clear, shared semantic models are a foundational discipline for modern data engineering, preventing semantic drift, speeding up…
Summary In this episode of the Data Engineering Podcast, Jamie Knowles (Product Director) and Ryan…
02 March 2026 | 00:45:02
Summary In this episode of the Data Engineering Podcast, Vasilije "Vas" Markovich, founder of Cognee, discusses building agentic memory, a crucial aspect of artificial intelligence that enables systems to learn, adapt, and retain knowledge over time. He explains the concept of agentic memory, highlighting the importance of distinguishing between…
Summary In this episode of the Data Engineering Podcast, Vasilije "Vas" Markovich, founder of…
22 February 2026 | 00:57:47
Summary In this episode of the Data Engineering Podcast, Aman Agarwal, creator of OpenLit, discusses the operational groundwork required to run LLM-powered applications reliably and cost-effectively. He highlights common blind spots that teams face, including opaque model behavior, runaway token costs, and brittle prompt management, and explains…
Summary In this episode of the Data Engineering Podcast, Aman Agarwal, creator of OpenLit,…
15 February 2026 | 00:50:43
Summary In this episode, Shilpa Kolhar, SVP of Product and Engineering at MongoDB, discusses using MongoDB as a unified foundation for AI-driven and agentic applications. She explains how the Application Modernization Platform (AMP) accelerates the transition from legacy relational systems to a document-first architecture, driven by the need for…
Summary In this episode, Shilpa Kolhar, SVP of Product and Engineering at MongoDB, discusses using…
08 February 2026 | 00:46:45