Data Engineering Podcast


This show goes behind the scenes for the tools, techniques, and difficulties associated with the discipline of data engineering. Databases, workflows, automation, and data manipulation are just some of the topics that you will find here.

Support the show!

Rewind 10 seconds
1X
Skip 30 seconds ahead
0:00/0:00

Listen in your favorite app:



More options

Here are shows you might like

See show recommendations
AI Engineering Podcast
Tobias Macey
The Python Podcast.__init__
Tobias Macey

485 Episodes

Context Engineering as a Discipline: Building Governed AI Analytics - E484

Summary In this episode of the Data Engineering Podcast, host Tobias Macey welcomes back Nick Schrock, CTO and founder of Dagster Labs, to discuss Compass - a Slack-native, agentic analytics system designed to keep data teams connected with business stakeholders. Nick shares his journey from initial skepticism to embracing agentic AI as model and…

Summary In this episode of the Data Engineering Podcast, host Tobias Macey welcomes back Nick…

11 October 2025 | 00:51:58


The Data Model That Captures Your Business: Metric Trees Explained - E483

Summary In this episode of the Data Engineering Podcast Vijay Subramanian, founder and CEO of Trace, talks about metric trees - a new approach to data modeling that directly captures a company's business model. Vijay shares insights from his decade-long experience building data practices at Rent the Runway and explains how the modern data stack has…

Summary In this episode of the Data Engineering Podcast Vijay Subramanian, founder and CEO of Trace,…

05 October 2025 | 01:01:05


From GPUs-as-a-Service to Workloads-as-a-Service: Flex AI’s Path to High-Utilization AI Infra - E482

Summary In this crossover episode of the AI Engineering Podcast, host Tobias Macey interviews Brijesh Tripathi, CEO of Flex AI, about revolutionizing AI engineering by removing DevOps burdens through "workload as a service". Brijesh shares his expertise from leading AI/HPC architecture at Intel and deploying supercomputers like Aurora, highlighting…

Summary In this crossover episode of the AI Engineering Podcast, host Tobias Macey interviews…

28 September 2025 | 00:56:31


From RAG to Relational: How Agentic Patterns Are Reshaping Data Architecture - E481

Summary In this episode of the AI Engineering Podcast Mark Brooker, VP and Distinguished Engineer at AWS, talks about how agentic workflows are transforming database usage and infrastructure design. He discusses the evolving role of data in AI systems, from traditional models to more modern approaches like vectors, RAG, and relational databases.…

Summary In this episode of the AI Engineering Podcast Mark Brooker, VP and Distinguished Engineer at…

18 September 2025 | 00:52:58


Duck Lake: Simplifying the Lakehouse Ecosystem - E480

Summary In this episode of the Data Engineering Podcast Hannes Mühleisen and Mark Raasveldt, the creators of DuckDB, share their work on Duck Lake, a new entrant in the open lakehouse ecosystem. They discuss how Duck Lake, is focused on simplicity, flexibility, and offers a unified catalog and table format compared to other lakehouse formats like…

Summary In this episode of the Data Engineering Podcast Hannes Mühleisen and Mark Raasveldt, the…

10 September 2025 | 01:10:41


Aligning Business and Data: The Essential Role of Data Modeling - E479

Summary In this episode of the Data Engineering Podcast Serge Gershkovich, head of product at SQL DBM, talks about the socio-technical aspects of data modeling. Serge shares his background in data modeling and highlights its importance as a collaborative process between business stakeholders and data teams. He debunks common misconceptions that…

Summary In this episode of the Data Engineering Podcast Serge Gershkovich, head of product at SQL…

01 September 2025 | 01:06:51


From Academia to Industry: Bridging Data Engineering Challenges - E478

Summary In this episode of the Data Engineering Podcast Professor Paul Groth, from the University of Amsterdam, talks about his research on knowledge graphs and data engineering. Paul shares his background in AI and data management, discussing the evolution of data provenance and lineage, as well as the challenges of data integration. He explores…

Summary In this episode of the Data Engineering Podcast Professor Paul Groth, from the University of…

26 August 2025 | 00:50:54


High Performance And Low Overhead Graphs With KuzuDB - E477

Summary In this episode of the Data Engineering Podcast Prashanth Rao, an AI engineer at KuzuDB, talks about their embeddable graph database. Prashanth explains how KuzuDB addresses performance shortcomings in existing solutions through columnar storage and novel join algorithms. He discusses the usability and scalability of KuzuDB, emphasizing its…

Summary In this episode of the Data Engineering Podcast Prashanth Rao, an AI engineer at KuzuDB,…

18 August 2025 | 01:01:29


Bridging Data and Decision-Making: AI's Role in Modern Analytics - E476

Summary In this episode of the Data Engineering Podcast Lucas Thelosen and Drew Gilson from Gravity talk about their development of Orion, an autonomous data analyst that bridges the gap between data availability and business decision-making. Lucas and Drew share their backgrounds in data analytics and how their experiences have shaped their…

Summary In this episode of the Data Engineering Podcast Lucas Thelosen and Drew Gilson from Gravity…

12 August 2025 | 01:10:44


From Bits to Tables: The Evolution of S3 Storage - E475

Summary In this episode of the Data Engineering Podcast Andy Warfield talks about the innovative functionalities of S3 Tables and Vectors and their integration into modern data stacks. Andy shares his journey through the tech industry and his role at Amazon, where he collaborates to enhance storage capabilities, discussing the evolution of S3 from…

Summary In this episode of the Data Engineering Podcast Andy Warfield talks about the innovative…

05 August 2025 | 00:50:08