Data Engineering Podcast

This show goes behind the scenes for the tools, techniques, and difficulties associated with the discipline of data engineering. Databases, workflows, automation, and data manipulation are just some of the topics that you will find here.

Support the show!

Listen in your favorite app:

Pick your app with Episodes.fm

More options

Amazon Music

Show RSS Feed

Click to copy to clipboard

Here are shows you might like

See show recommendations

AI Engineering Podcast
Tobias Macey

The Python Podcast.__init__
Tobias Macey

Streamlining Data Pipelines with MCP Servers and Vector Engines - E472

Summary In this episode of the Data Engineering Podcast Kacper Łukawski from Qdrant about integrating MCP servers with vector databases to process unstructured data. Kacper shares his experience in data engineering, from building big data pipelines in the automotive industry to leveraging large language models (LLMs) for transforming unstructured…

Summary In this episode of the Data Engineering Podcast Kacper Łukawski from Qdrant about…

15 July 2025 | 00:52:04

Foundational Data Engineering At Two Sigma - E471

Summary In this episode of the Data Engineering Podcast Effie Baram, a leader in foundational data engineering at Two Sigma, talks about the complexities and innovations in data engineering within the finance sector. She discusses the critical role of data at Two Sigma, balancing data quality with delivery speed, and the socio-technical challenges…

Summary In this episode of the Data Engineering Podcast Effie Baram, a leader in foundational data…

06 July 2025 | 00:55:05

Enabling Agents In The Enterprise With A Platform Approach - E470

Summary In this episode of the Data Engineering Podcast Arun Joseph talks about developing and implementing agent platforms to empower businesses with agentic capabilities. From leading AI engineering at Deutsche Telekom to his current entrepreneurial venture focused on multi-agent systems, Arun shares insights on building agentic systems at an…

Summary In this episode of the Data Engineering Podcast Arun Joseph talks about developing and…

29 June 2025 | 00:54:18

Dagster's New Era: Modularizing Data Transformation in the Age of AI - E469

Summary In this episode of the Data Engineering Podcast we welcome back Nick Schrock, CTO and founder of Dagster Labs, to discuss the evolving landscape of data engineering in the age of AI. As AI begins to impact data platforms and the role of data engineers, Nick shares his insights on how it will ultimately enhance productivity and expand…

Summary In this episode of the Data Engineering Podcast we welcome back Nick Schrock, CTO and…

18 June 2025 | 01:01:37

AI and the Lakehouse: How Starburst is Pioneering New Workflows - E468

Summary In this episode of the Data Engineering Podcast Alex Albu, tech lead for AI initiatives at Starburst, talks about integrating AI workloads with the lakehouse architecture. From his software engineering roots to leading data engineering efforts, Alex shares insights on enhancing Starburst's platform to support AI applications, including an…

Summary In this episode of the Data Engineering Podcast Alex Albu, tech lead for AI initiatives at…

11 June 2025 | 00:44:09

Amazon S3: The Backbone of Modern Data Systems - E467

Summary In this episode of the Data Engineering Podcast Mai-Lan Tomsen Bukovec, Vice President of Technology at AWS, talks about the evolution of Amazon S3 and its profound impact on data architecture. From her work on compute systems to leading the development and operations of S3, Mylan shares insights on how S3 has become a foundational element…

Summary In this episode of the Data Engineering Podcast Mai-Lan Tomsen Bukovec, Vice President of…

03 June 2025 | 01:01:01

Scaling Data Operations With Platform Engineering - E466

Summary In this episode of the Data Engineering Podcast Chakravarthy Kotaru talks about scaling data operations through standardized platform offerings. From his roots as an Oracle developer to leading the data platform at a major online travel company, Chakravarthy shares insights on managing diverse database technologies and providing databases…

Summary In this episode of the Data Engineering Podcast Chakravarthy Kotaru talks about scaling data…

29 May 2025 | 00:42:20

From Data Discovery to AI: The Evolution of Semantic Layers - E465

Summary In this episode of the Data Engineering Podcast, host Tobias Macy welcomes back Shinji Kim to discuss the evolving role of semantic layers in the era of AI. As they explore the challenges of managing vast data ecosystems and providing context to data users, they delve into the significance of semantic layers for AI applications. They dive…

Summary In this episode of the Data Engineering Podcast, host Tobias Macy welcomes back Shinji Kim…

21 May 2025 | 00:49:30

Balancing Off-the-Shelf and Custom Solutions in Data Engineering - E464

Summary In this episode of the Data Engineering Podcast Tulika Bhatt, a senior software engineer at Netflix, talks about her experiences with large-scale data processing and the future of data engineering technologies. Tulika shares her journey into the data engineering field, discussing her work at BlackRock and Verizon before joining Netflix, and…

Summary In this episode of the Data Engineering Podcast Tulika Bhatt, a senior software engineer at…

13 May 2025 | 00:46:05

StarRocks: Bridging Lakehouse and OLAP for High-Performance Analytics - E463

Summary In this episode of the Data Engineering Podcast Sida Shen, product manager at CelerData, talks about StarRocks, a high-performance analytical database. Sida discusses the inception of StarRocks, which was forked from Apache Doris in 2020 and evolved into a high-performance Lakehouse query engine. He explains the architectural design of…

Summary In this episode of the Data Engineering Podcast Sida Shen, product manager at CelerData,…

05 May 2025 | 00:59:41