This show goes behind the scenes for the tools, techniques, and difficulties associated with the discipline of data engineering. Databases, workflows, automation, and data manipulation are just some of the topics that you will find here.
Support the show!Listen in your favorite app:
FountainHere are shows you might like
Summary In this episode of the Data Engineering Podcast, host Tobias Macey welcomes back Nick Schrock, CTO and founder of Dagster Labs, to discuss Compass - a Slack-native, agentic analytics system designed to keep data teams connected with business stakeholders. Nick shares his journey from initial skepticism to embracing agentic AI as model and…
Summary In this episode of the Data Engineering Podcast, host Tobias Macey welcomes back Nick…
11 October 2025 | 00:51:58
Summary In this episode of the Data Engineering Podcast Vijay Subramanian, founder and CEO of Trace, talks about metric trees - a new approach to data modeling that directly captures a company's business model. Vijay shares insights from his decade-long experience building data practices at Rent the Runway and explains how the modern data stack has…
Summary In this episode of the Data Engineering Podcast Vijay Subramanian, founder and CEO of Trace,…
05 October 2025 | 01:01:05
Summary In this crossover episode of the AI Engineering Podcast, host Tobias Macey interviews Brijesh Tripathi, CEO of Flex AI, about revolutionizing AI engineering by removing DevOps burdens through "workload as a service". Brijesh shares his expertise from leading AI/HPC architecture at Intel and deploying supercomputers like Aurora, highlighting…
Summary In this crossover episode of the AI Engineering Podcast, host Tobias Macey interviews…
28 September 2025 | 00:56:31
Summary In this episode of the AI Engineering Podcast Mark Brooker, VP and Distinguished Engineer at AWS, talks about how agentic workflows are transforming database usage and infrastructure design. He discusses the evolving role of data in AI systems, from traditional models to more modern approaches like vectors, RAG, and relational databases.…
Summary In this episode of the AI Engineering Podcast Mark Brooker, VP and Distinguished Engineer at…
18 September 2025 | 00:52:58
Summary In this episode of the Data Engineering Podcast Hannes Mühleisen and Mark Raasveldt, the creators of DuckDB, share their work on Duck Lake, a new entrant in the open lakehouse ecosystem. They discuss how Duck Lake, is focused on simplicity, flexibility, and offers a unified catalog and table format compared to other lakehouse formats like…
Summary In this episode of the Data Engineering Podcast Hannes Mühleisen and Mark Raasveldt, the…
10 September 2025 | 01:10:41
Summary In this episode of the Data Engineering Podcast Serge Gershkovich, head of product at SQL DBM, talks about the socio-technical aspects of data modeling. Serge shares his background in data modeling and highlights its importance as a collaborative process between business stakeholders and data teams. He debunks common misconceptions that…
Summary In this episode of the Data Engineering Podcast Serge Gershkovich, head of product at SQL…
01 September 2025 | 01:06:51
Summary In this episode of the Data Engineering Podcast Professor Paul Groth, from the University of Amsterdam, talks about his research on knowledge graphs and data engineering. Paul shares his background in AI and data management, discussing the evolution of data provenance and lineage, as well as the challenges of data integration. He explores…
Summary In this episode of the Data Engineering Podcast Professor Paul Groth, from the University of…
26 August 2025 | 00:50:54
Summary In this episode of the Data Engineering Podcast Prashanth Rao, an AI engineer at KuzuDB, talks about their embeddable graph database. Prashanth explains how KuzuDB addresses performance shortcomings in existing solutions through columnar storage and novel join algorithms. He discusses the usability and scalability of KuzuDB, emphasizing its…
Summary In this episode of the Data Engineering Podcast Prashanth Rao, an AI engineer at KuzuDB,…
18 August 2025 | 01:01:29
Summary In this episode of the Data Engineering Podcast Lucas Thelosen and Drew Gilson from Gravity talk about their development of Orion, an autonomous data analyst that bridges the gap between data availability and business decision-making. Lucas and Drew share their backgrounds in data analytics and how their experiences have shaped their…
Summary In this episode of the Data Engineering Podcast Lucas Thelosen and Drew Gilson from Gravity…
12 August 2025 | 01:10:44
Summary In this episode of the Data Engineering Podcast Andy Warfield talks about the innovative functionalities of S3 Tables and Vectors and their integration into modern data stacks. Andy shares his journey through the tech industry and his role at Amazon, where he collaborates to enhance storage capabilities, discussing the evolution of S3 from…
Summary In this episode of the Data Engineering Podcast Andy Warfield talks about the innovative…
05 August 2025 | 00:50:08