Data Engineering Podcast


This show goes behind the scenes for the tools, techniques, and difficulties associated with the discipline of data engineering. Databases, workflows, automation, and data manipulation are just some of the topics that you will find here.

Support the show!

Rewind 10 seconds
1X
Skip 30 seconds ahead
0:00/0:00

Listen in your favorite app:



More options

Here are shows you might like

See show recommendations
AI Engineering Podcast
Tobias Macey
The Python Podcast.__init__
Tobias Macey

468 Episodes

Amazon S3: The Backbone of Modern Data Systems - E467

Summary In this episode of the Data Engineering Podcast Mai-Lan Tomsen Bukovec, Vice President of Technology at AWS, talks about the evolution of Amazon S3 and its profound impact on data architecture. From her work on compute systems to leading the development and operations of S3, Mylan shares insights on how S3 has become a foundational element…

Summary In this episode of the Data Engineering Podcast Mai-Lan Tomsen Bukovec, Vice President of…

03 June 2025 | 01:01:01


Scaling Data Operations With Platform Engineering - E466

Summary In this episode of the Data Engineering Podcast Chakravarthy Kotaru talks about scaling data operations through standardized platform offerings. From his roots as an Oracle developer to leading the data platform at a major online travel company, Chakravarthy shares insights on managing diverse database technologies and providing databases…

Summary In this episode of the Data Engineering Podcast Chakravarthy Kotaru talks about scaling data…

29 May 2025 | 00:42:20


From Data Discovery to AI: The Evolution of Semantic Layers - E465

Summary In this episode of the Data Engineering Podcast, host Tobias Macy welcomes back Shinji Kim to discuss the evolving role of semantic layers in the era of AI. As they explore the challenges of managing vast data ecosystems and providing context to data users, they delve into the significance of semantic layers for AI applications. They dive…

Summary In this episode of the Data Engineering Podcast, host Tobias Macy welcomes back Shinji Kim…

21 May 2025 | 00:49:30


Balancing Off-the-Shelf and Custom Solutions in Data Engineering - E464

Summary In this episode of the Data Engineering Podcast Tulika Bhatt, a senior software engineer at Netflix, talks about her experiences with large-scale data processing and the future of data engineering technologies. Tulika shares her journey into the data engineering field, discussing her work at BlackRock and Verizon before joining Netflix, and…

Summary In this episode of the Data Engineering Podcast Tulika Bhatt, a senior software engineer at…

13 May 2025 | 00:46:05


StarRocks: Bridging Lakehouse and OLAP for High-Performance Analytics - E463

Summary In this episode of the Data Engineering Podcast Sida Shen, product manager at CelerData, talks about StarRocks, a high-performance analytical database. Sida discusses the inception of StarRocks, which was forked from Apache Doris in 2020 and evolved into a high-performance Lakehouse query engine. He explains the architectural design of…

Summary In this episode of the Data Engineering Podcast Sida Shen, product manager at CelerData,…

05 May 2025 | 00:59:41


Exploring NATS: A Multi-Paradigm Connectivity Layer for Distributed Applications - E462

Summary In this episode of the Data Engineering Podcast Derek Collison, creator of NATS and CEO of Synadia, talks about the evolution and capabilities of NATS as a multi-paradigm connectivity layer for distributed applications. Derek discusses the challenges and solutions in building distributed systems, and highlights the unique features of NATS…

Summary In this episode of the Data Engineering Podcast Derek Collison, creator of NATS and CEO of…

28 April 2025 | 01:12:50


Advanced Lakehouse Management With The LakeKeeper Iceberg REST Catalog - E461

Summary In this episode of the Data Engineering Podcast Viktor Kessler, co-founder of Vakmo, talks about the architectural patterns in the lake house enabled by a fast and feature-rich Iceberg catalog. Viktor shares his journey from data warehouses to developing the open-source project, Lakekeeper, an Apache Iceberg REST catalog written in Rust…

Summary In this episode of the Data Engineering Podcast Viktor Kessler, co-founder of Vakmo, talks…

21 April 2025 | 00:57:13


Simplifying Data Pipelines with Durable Execution - E460

Summary In this episode of the Data Engineering Podcast Jeremy Edberg, CEO of DBOS, about durable execution and its impact on designing and implementing business logic for data systems. Jeremy explains how DBOS's serverless platform and orchestrator provide local resilience and reduce operational overhead, ensuring exactly-once execution in…

Summary In this episode of the Data Engineering Podcast Jeremy Edberg, CEO of DBOS, about durable…

12 April 2025 | 00:39:49


Overcoming Redis Limitations: The Dragonfly DB Approach - E459

Summary In this episode of the Data Engineering Podcast Roman Gershman, CTO and founder of Dragonfly DB, explores the development and impact of high-speed in-memory databases. Roman shares his experience creating a more efficient alternative to Redis, focusing on performance gains, scalability, and cost efficiency, while addressing limitations such…

Summary In this episode of the Data Engineering Podcast Roman Gershman, CTO and founder of Dragonfly…

30 March 2025 | 00:43:58


Bringing AI Into The Inner Loop of Data Engineering With Ascend - E458

Summary In this episode of the Data Engineering Podcast Sean Knapp, CEO of Ascend.io, explores the intersection of AI and data engineering. He discusses the evolution of data engineering and the role of AI in automating processes, alleviating burdens on data engineers, and enabling them to focus on complex tasks and innovation. The conversation…

Summary In this episode of the Data Engineering Podcast Sean Knapp, CEO of Ascend.io, explores the…

24 March 2025 | 00:52:47