Data Engineering Podcast
Episode Archive
Episode Archive
423 episodes of Data Engineering Podcast since the first episode, which aired on January 7th, 2017.
-
Scale Your Spatial Analysis By Building It In SQL With Syntax Extensions
February 6th, 2022 | 59 mins 54 secs
An interview with Matthew Forrest about using SQL to build your spatial analysis workflows so that they are more maintainable and uniform
-
Scalable Strategies For Protecting Data Privacy In Your Shared Data Sets
February 6th, 2022 | 1 hr 6 secs
An interview with Privacy Dynamics lead engineer Will Thompson about useful strategies for managing data privacy in your shared data sets.
-
A Reflection On Learning A Lot More Than 97 Things Every Data Engineer Should Know
January 30th, 2022 | 41 mins 35 secs
An exploration of the macroscopic and microscopic themes and details that are useful for new and experienced data engineers to know in order to grow their careers.
-
Effective Pandas Patterns For Data Engineering
January 30th, 2022 | 1 hr 21 secs
An interview with Matt Harrison about how to write effective pandas code for scalable and maintainable data processing logic that can be understood by other members of your team.
-
The Importance Of Data Contracts As The Interface For Data Integration With Abhi Sivasailam
January 23rd, 2022 | 56 mins
An interview with Abhi Sivasailam about his work at Flexport to design and implement a data mesh solution that relies heavily on data contracts to provide a stable interface that teams can implement for integrating analytical workflows across the organization.
-
Building And Managing Data Teams And Data Platforms In Large Organizations With Ashish Mrig
January 23rd, 2022 | 52 mins 44 secs
An interview with Ashish Mrig about his career in data engineering, his experiences managing data teams at Wayfair, and the technical considerations that factor into platform design decisions in large organizations.
-
Automated Data Quality Management Through Machine Learning With Anomalo
January 15th, 2022 | 1 hr 2 mins
An interview with the founders of Anomalo about how they are using statistical machine learning systems to automate the detection and diagnosis of data quality issues that occur in your data warehouse.
-
An Introduction To Data And Analytics Engineering For Non-Programmers
January 15th, 2022 | 50 mins 13 secs
An interview with Brian McMillan about his work on the book "Building Data Products" and how he is bringing data professionals and business users into alignment for creating the systems that are necessary for organizations to succeed in the modern era.
-
Open Source Reverse ETL For Everyone With Grouparoo
January 7th, 2022 | 44 mins 56 secs
An interview with Brian Leonard about the open source reverse ETL framework Grouparoo and how you can start using it today.
-
Data Observability Out Of The Box With Metaplane
January 7th, 2022 | 50 mins 47 secs
An interview with Kevin Hu about his work on Metaplane to make implementing data observability practices as low friction as possible for data teams and organizations.
-
Creating Shared Context For Your Data Warehouse With A Controlled Vocabulary
January 1st, 2022 | 1 hr 34 secs
An interview with Emily Riederer about the work of establishing a controlled vocabulary for building a shared context in your data warehouse to reduce communication overhead.
-
A Reflection On The Data Ecosystem For The Year 2021
January 1st, 2022 | 1 hr 3 mins
A wide ranging conversation among a panel of data professionals about their view on the past year's trends in the data management and analytics ecosystem and what we might expect for the year to come.
-
Exploring The Evolving Role Of Data Engineers
December 26th, 2021 | 57 mins 41 secs
An interview with Maxime Beauchemin about how the technological progression in the data ecosystem is driving a constant change in the role and responsibilities of data engineers.
-
Revisiting The Technical And Social Benefits Of The Data Mesh
December 26th, 2021 | 1 hr 10 mins
An interview with Zhamak Dehghani about her experience working with the community that has grown up around her idea of the data mesh and the lessons that she has learned.
-
Fast And Flexible Headless Data Analytics With Cube.JS
December 21st, 2021 | 54 mins 43 secs
An interview with the creators of Cube.JS about their work to build an open source framework for performant OLAP queries delivered through web and SQL APIs.
-
Building A System Of Record For Your Organization's Data Ecosystem At Metaphor
December 19th, 2021 | 1 hr 5 mins
An interview with the founders of Metaphor about their work to build a system of record for all of the data in your organization that bridges the technical and social requirements of your teams.