Data Engineering Podcast


This show goes behind the scenes for the tools, techniques, and difficulties associated with the discipline of data engineering. Databases, workflows, automation, and data manipulation are just some of the topics that you will find here.

Support the show!

Rewind 10 seconds
1X
Skip 30 seconds ahead
0:00/0:00

Listen in your favorite app:



More options

Here are shows you might like

See show recommendations
AI Engineering Podcast
Tobias Macey
The Python Podcast.__init__
Tobias Macey

478 Episodes

High Performance And Low Overhead Graphs With KuzuDB - E477

Summary In this episode of the Data Engineering Podcast Prashanth Rao, an AI engineer at KuzuDB, talks about their embeddable graph database. Prashanth explains how KuzuDB addresses performance shortcomings in existing solutions through columnar storage and novel join algorithms. He discusses the usability and scalability of KuzuDB, emphasizing its…

Summary In this episode of the Data Engineering Podcast Prashanth Rao, an AI engineer at KuzuDB,…

18 August 2025 | 01:01:29


Bridging Data and Decision-Making: AI's Role in Modern Analytics - E476

Summary In this episode of the Data Engineering Podcast Lucas Thelosen and Drew Gilson from Gravity talk about their development of Orion, an autonomous data analyst that bridges the gap between data availability and business decision-making. Lucas and Drew share their backgrounds in data analytics and how their experiences have shaped their…

Summary In this episode of the Data Engineering Podcast Lucas Thelosen and Drew Gilson from Gravity…

12 August 2025 | 01:10:44


From Bits to Tables: The Evolution of S3 Storage - E475

Summary In this episode of the Data Engineering Podcast Andy Warfield talks about the innovative functionalities of S3 Tables and Vectors and their integration into modern data stacks. Andy shares his journey through the tech industry and his role at Amazon, where he collaborates to enhance storage capabilities, discussing the evolution of S3 from…

Summary In this episode of the Data Engineering Podcast Andy Warfield talks about the innovative…

05 August 2025 | 00:50:08


Revolutionizing Python Notebooks with Marimo - E474

Summary In this episode of the Data Engineering Podcast Akshay Agrawal from Marimo discusses the innovative new Python notebook environment, which offers a reactive execution model, full Python integration, and built-in UI elements to enhance the interactive computing experience. He discusses the challenges of traditional Jupyter notebooks, such as…

Summary In this episode of the Data Engineering Podcast Akshay Agrawal from Marimo discusses the…

28 July 2025 | 00:51:56


Warehouse Native Incremental Data Processing With Dynamic Tables And Delayed View Semantics - E473

Summary In this episode of the Data Engineering Podcast Dan Sotolongo from Snowflake talks about the complexities of incremental data processing in warehouse environments. Dan discusses the challenges of handling continuously evolving datasets and the importance of incremental data processing for optimized resource use and reduced latency. He…

Summary In this episode of the Data Engineering Podcast Dan Sotolongo from Snowflake talks about the…

21 July 2025 | 00:55:07


Streamlining Data Pipelines with MCP Servers and Vector Engines - E472

Summary In this episode of the Data Engineering Podcast Kacper Łukawski from Qdrant about integrating MCP servers with vector databases to process unstructured data. Kacper shares his experience in data engineering, from building big data pipelines in the automotive industry to leveraging large language models (LLMs) for transforming unstructured…

Summary In this episode of the Data Engineering Podcast Kacper Łukawski from Qdrant about…

15 July 2025 | 00:52:04


Foundational Data Engineering At Two Sigma - E471

Summary In this episode of the Data Engineering Podcast Effie Baram, a leader in foundational data engineering at Two Sigma, talks about the complexities and innovations in data engineering within the finance sector. She discusses the critical role of data at Two Sigma, balancing data quality with delivery speed, and the socio-technical challenges…

Summary In this episode of the Data Engineering Podcast Effie Baram, a leader in foundational data…

06 July 2025 | 00:55:05


Enabling Agents In The Enterprise With A Platform Approach - E470

Summary In this episode of the Data Engineering Podcast Arun Joseph talks about developing and implementing agent platforms to empower businesses with agentic capabilities. From leading AI engineering at Deutsche Telekom to his current entrepreneurial venture focused on multi-agent systems, Arun shares insights on building agentic systems at an…

Summary In this episode of the Data Engineering Podcast Arun Joseph talks about developing and…

29 June 2025 | 00:54:18


Dagster's New Era: Modularizing Data Transformation in the Age of AI - E469

Summary In this episode of the Data Engineering Podcast we welcome back Nick Schrock, CTO and founder of Dagster Labs, to discuss the evolving landscape of data engineering in the age of AI. As AI begins to impact data platforms and the role of data engineers, Nick shares his insights on how it will ultimately enhance productivity and expand…

Summary In this episode of the Data Engineering Podcast we welcome back Nick Schrock, CTO and…

18 June 2025 | 01:01:37


AI and the Lakehouse: How Starburst is Pioneering New Workflows - E468

Summary In this episode of the Data Engineering Podcast Alex Albu, tech lead for AI initiatives at Starburst, talks about integrating AI workloads with the lakehouse architecture. From his software engineering roots to leading data engineering efforts, Alex shares insights on enhancing Starburst's platform to support AI applications, including an…

Summary In this episode of the Data Engineering Podcast Alex Albu, tech lead for AI initiatives at…

11 June 2025 | 00:44:09