Data Engineering Podcast


This show goes behind the scenes for the tools, techniques, and difficulties associated with the discipline of data engineering. Databases, workflows, automation, and data manipulation are just some of the topics that you will find here.

Support the show!

Rewind 10 seconds
1X
Skip 30 seconds ahead
0:00/0:00

Listen in your favorite app:



More options

Here are shows you might like

See show recommendations
AI Engineering Podcast
Tobias Macey
The Python Podcast.__init__
Tobias Macey

459 Episodes

The Importance Of Data Contracts As The Interface For Data Integration With Abhi Sivasailam - E258

Summary

Data platforms are exemplified by a complex set of connections that are subject to a set of constantly evolving requirements. In order to make this a tractable problem it is necessary to define boundaries for communication between concerns, which brings with it the need to establish…

Summary

Data platforms are exemplified by a…

23 January 2022 | 00:56:00


Building And Managing Data Teams And Data Platforms In Large Organizations With Ashish Mrig - E257

Summary

Data engineering is a relatively young and rapidly expanding field, with practitioners having a wide array of experiences as they navigate their careers. Ashish Mrig currently leads the data analytics platform for Wayfair, as well as running a local data engineering meetup. In this episode…

Summary

Data engineering is a relatively young…

23 January 2022 | 00:52:45


Automated Data Quality Management Through Machine Learning With Anomalo - E256

Summary

Data quality control is a requirement for being able to trust the various reports and machine learning models that are relying on the information that you curate. Rules based systems are useful for validating known requirements, but with the scale and complexity of data in modern…

Summary

Data quality control is a requirement for…

15 January 2022 | 01:02:30


An Introduction To Data And Analytics Engineering For Non-Programmers - E255

Summary

Applications of data have grown well beyond the venerable business intelligence dashboards that organizations have relied on for decades. Now it is being used to power consumer facing services, influence organizational behaviors, and build sophisticated machine learning systems. Given this…

Summary

Applications of data have grown well…

15 January 2022 | 00:50:14


Open Source Reverse ETL For Everyone With Grouparoo - E254

Summary

Reverse ETL is a product category that evolved from the landscape of customer data platforms with a number of companies offering their own implementation of it. While struggling with the work of automating data integration workflows with marketing, sales, and support tools Brian Leonard…

Summary

Reverse ETL is a product category that…

08 January 2022 | 00:44:57


Data Observability Out Of The Box With Metaplane - E253

Summary

Data observability is a set of technical and organizational capabilities related to understanding how your data is being processed and used so that you can proactively identify and fix errors in your workflows. In this episode Metaplane founder Kevin Hu shares his working definition of the…

Summary

Data observability is a set of technical…

08 January 2022 | 00:50:48


Creating Shared Context For Your Data Warehouse With A Controlled Vocabulary - E252

Summary

Communication and shared context are the hardest part of any data system. In recent years the focus has been on data catalogs as the means for documenting data assets, but those introduce a secondary system of record in order to find the necessary information. In this episode Emily Riederer…

Summary

Communication and shared context are the…

02 January 2022 | 01:00:35


A Reflection On The Data Ecosystem For The Year 2021 - E251

Summary

This has been an active year for the data ecosystem, with a number of new product categories and substantial growth in existing areas. In an attempt to capture the zeitgeist Maura Church, David Wallace, Benn Stancil, and Gleb Mezhanskiy join the show to reflect on the past year and share…

Summary

This has been an active year for the data…

02 January 2022 | 01:03:29


Exploring The Evolving Role Of Data Engineers - E249

Summary

Data Engineering is still a relatively new field that is going through a continued evolution as new technologies are introduced and new requirements are understood. In this episode Maxime Beauchemin returns to revisit what it means to be a data engineer and how the role has changed over the…

Summary

Data Engineering is still a relatively…

27 December 2021 | 00:57:42


Revisiting The Technical And Social Benefits Of The Data Mesh - E250

Summary

The data mesh is a thesis that was presented to address the technical and organizational challenges that businesses face in managing their analytical workflows at scale. Zhamak Dehghani introduced the concepts behind this architectural patterns in 2019, and since then it has been gaining…

Summary

The data mesh is a thesis that was…

27 December 2021 | 01:10:53