Data Engineering Podcast
Episode Archive
Episode Archive
419 episodes of Data Engineering Podcast since the first episode, which aired on January 7th, 2017.
-
Change Data Capture For All Of Your Databases With Debezium
January 5th, 2020 | 53 mins 1 sec
An interview about how the Debezium framework simplifies implementing change data capture for all of your database engines
-
Building The DataDog Platform For Processing Timeseries Data At Massive Scale
December 30th, 2019 | 45 mins 54 secs
An interview with a DataDog engineer about how they build reliable and highly available systems for processing timeseries data in real time and at massive scale
-
Building The Materialize Engine For Interactive Streaming Analytics In SQL
December 22nd, 2019 | 48 mins 7 secs
An episode about building Materialize for interactive analytics on continuously updated streams of data
-
Solving Data Lineage Tracking And Data Discovery At WeWork
December 16th, 2019 | 1 hr 1 min
An interview about how the Marquez platform for metadata management powers data lineage tracking, data discovery, and health reporting at WeWork
-
SnowflakeDB: The Data Warehouse Built For The Cloud
December 8th, 2019 | 58 mins 56 secs
An interview about how SnowflakeDB was built to provide a performant and flexible data platform for the cloud era
-
Organizing And Empowering Data Engineers At Citadel
December 2nd, 2019 | 45 mins 50 secs
An interview about building a successful data team and managing their career growth to power a successful financial business
-
Building A Real Time Event Data Warehouse For Sentry
November 26th, 2019 | 1 hr 1 min
An interview about how Sentry used Clickhouse to build an event data warehouse and pay down their architecture debt
-
Escaping Analysis Paralysis For Your Data Platform With Data Virtualization
November 18th, 2019 | 55 mins 42 secs
An interview about data virtualization and data engineering automation with AtScale and the value of abstractions for your data platform architecture
-
Designing For Data Protection
November 11th, 2019 | 51 mins 23 secs
An interview about data protection regulations and how they can influence the design of your data platform
-
Automating Your Production Dataflows On Spark
November 4th, 2019 | 48 mins 50 secs
An interview about how the Ascend platform provides an autonomous data orchestration platform to simplify your production dataflows
-
Build Maintainable And Testable Data Applications With Dagster
October 28th, 2019 | 1 hr 7 mins
An interview about the Dagster framework and how you can use it to build testable and maintainable data applications
-
Data Orchestration For Hybrid Cloud Analytics
October 21st, 2019 | 42 mins 51 secs
An interview about the emerging category of data orchestration platforms and how they can be used to bridge the gap between modern and legacy analytics systems
-
Keeping Your Data Warehouse In Order With DataForm
October 14th, 2019 | 47 mins 4 secs
An interview about Dataform and how it helps you to keep your data warehouse in good working order
-
Fast Analytics On Semi-Structured And Structured Data In The Cloud
October 7th, 2019 | 54 mins 38 secs
An interview about the architecture of Rockset and how they built a serverless platform for fast and flexible analytics on your semi-structured data
-
Ship Faster With An Opinionated Data Pipeline Framework
September 30th, 2019 | 35 mins 8 secs
An interview about how the open source Kedro framework makes it faster and easier to build your end-to-end data pipeline for machine learning projects
-
Open Source Object Storage For All Of Your Data
September 22nd, 2019 | 1 hr 8 mins
An interview on the open source MinIO platform for fast and flexible object storage for data intensive applications and analytics that runs everywhere