Data Engineering Podcast
Weekly deep dives on data management with the engineers and entrepreneurs who are shaping the industry
About the show
This show goes behind the scenes for the tools, techniques, and difficulties associated with the discipline of data engineering. Databases, workflows, automation, and data manipulation are just some of the topics that you will find here.
Episodes
-
The Benefits And Challenges Of Building A Data Trust
February 3rd, 2020 | 56 mins 52 secs
An interview about the BrightHive platform for building data trusts and the complexities that are inherent in sharing data across organizations
-
Pay Down Technical Debt In Your Data Pipeline With Great Expectations
January 26th, 2020 | 46 mins 31 secs
An interview about how the Great Expectations framework helps you add meaningful tests and validation to your data pipeline to drive down technical debt
-
Replatforming Production Dataflows
January 20th, 2020 | 39 mins
An interview about how Mayvenn replatformed their production dataflows using Ascend and improved their ability to deliver meaningful analytics to their business
-
Planet Scale SQL For The New Generation Of Applications With YugabyteDB
January 13th, 2020 | 1 hr 1 min
An interview about YugabyteDB and how it was architected to power the new generation of planet scale applications
-
Change Data Capture For All Of Your Databases With Debezium
January 5th, 2020 | 53 mins 1 sec
An interview about how the Debezium framework simplifies implementing change data capture for all of your database engines
-
Building The DataDog Platform For Processing Timeseries Data At Massive Scale
December 30th, 2019 | 45 mins 54 secs
An interview with a DataDog engineer about how they build reliable and highly available systems for processing timeseries data in real time and at massive scale
-
Building The Materialize Engine For Interactive Streaming Analytics In SQL
December 22nd, 2019 | 48 mins 7 secs
An episode about building Materialize for interactive analytics on continuously updated streams of data
-
Solving Data Lineage Tracking And Data Discovery At WeWork
December 16th, 2019 | 1 hr 1 min
An interview about how the Marquez platform for metadata management powers data lineage tracking, data discovery, and health reporting at WeWork
-
SnowflakeDB: The Data Warehouse Built For The Cloud
December 8th, 2019 | 58 mins 56 secs
An interview about how SnowflakeDB was built to provide a performant and flexible data platform for the cloud era
-
Organizing And Empowering Data Engineers At Citadel
December 2nd, 2019 | 45 mins 50 secs
An interview about building a successful data team and managing their career growth to power a successful financial business
-
Building A Real Time Event Data Warehouse For Sentry
November 26th, 2019 | 1 hr 1 min
An interview about how Sentry used Clickhouse to build an event data warehouse and pay down their architecture debt
-
Escaping Analysis Paralysis For Your Data Platform With Data Virtualization
November 18th, 2019 | 55 mins 42 secs
An interview about data virtualization and data engineering automation with AtScale and the value of abstractions for your data platform architecture
-
Designing For Data Protection
November 11th, 2019 | 51 mins 23 secs
An interview about data protection regulations and how they can influence the design of your data platform
-
Automating Your Production Dataflows On Spark
November 4th, 2019 | 48 mins 50 secs
An interview about how the Ascend platform provides an autonomous data orchestration platform to simplify your production dataflows
-
Build Maintainable And Testable Data Applications With Dagster
October 28th, 2019 | 1 hr 7 mins
An interview about the Dagster framework and how you can use it to build testable and maintainable data applications
-
Data Orchestration For Hybrid Cloud Analytics
October 21st, 2019 | 42 mins 51 secs
An interview about the emerging category of data orchestration platforms and how they can be used to bridge the gap between modern and legacy analytics systems