Data Engineering Podcast
Episode Archive
Episode Archive
422 episodes of Data Engineering Podcast since the first episode, which aired on January 7th, 2017.
-
Moving Machine Learning Into The Data Pipeline at Cherre
April 19th, 2021 | 48 mins 4 secs
An interview about how the team at Cherre built an internal machine learning project to use as a service in their data pipelines to make dealing with messy address data less painful.
-
Exploring The Expanding Landscape Of Data Professions with Josh Benamram of Databand
April 12th, 2021 | 1 hr 8 mins
An interview with Josh Benamram about the emerging roles across the data ecosystem and how they interact with data systems.
-
Put Your Whole Data Team On The Same Page With Atlan
April 5th, 2021 | 57 mins 36 secs
In this episode Prukalpa Sankar discusses how Atlan uses metadata from all of your workflows to bring everyone on the same page, letting you delivery on your data projects in record time.
-
Data Quality Management For The Whole Team With Soda Data
March 29th, 2021 | 58 mins
An interview about the Soda Data platform and the open source components that they are building to level up the quality of your data pipelines.
-
Real World Change Data Capture At Datacoral
March 22nd, 2021 | 49 mins 58 secs
An interview with Raghu Murthy about the reality of running and maintaining change data capture pipelines for customers at Datacoral.
-
Managing The DoorDash Data Platform
March 15th, 2021 | 46 mins 4 secs
An interview with Sudhir Tonse about his work at DoorDash to help build a data platform that scales to meet the self service needs of analysts and data scientists.
-
Leave Your Data Where It Is And Automate Feature Extraction With Molecula
March 8th, 2021 | 51 mins 39 secs
An interview with H.O. Maycotte about how the Molecula platform and the underlying Pilosa engine allows you to automatically do feature extraction from your data without having to centralize it.
-
Bridging The Gap Between Machine Learning And Operations At Iguazio
March 1st, 2021 | 1 hr 6 mins
An interview about how the Iguazio platform reduces the friction in bringing your machine learning workloads to production in a fast and maintainable way.
-
Self Service Open Source Data Integration With AirByte
February 22nd, 2021 | 52 mins 15 secs
An interview about the open source data integration platform Airbyte and how you can use it to provide self service access to your data consumers.
-
Building The Foundations For Data Driven Businesses at 5xData
February 15th, 2021 | 52 mins 15 secs
An interview with Tarush Aggarwal about his work at 5xData to help more companies build the foundations they need to become truly data driven.
-
How Shopify Is Building Their Production Data Warehouse Using DBT
February 8th, 2021 | 46 mins 30 secs
An interview with Shopify's engineers about how they are using DBT to build a data warehouse platform that scales to meet the needs of the business.
-
System Observability For The Cloud Native Era With Chronosphere
February 1st, 2021 | 1 hr 4 mins
An interview about the Chronosphere platform and the M3DB storage engine for managing system metrics to power observability in the cloud native era.
-
Making It Easier To Stick B2B Data Integration Pipelines Together With Hotglue
January 25th, 2021 | 34 mins 5 secs
An interview with the founders of the Hotglue platform about how they are helping B2B application developers build data integration pipelines for working with customer information.
-
Using Your Data Warehouse As The Source Of Truth For Customer Data With Hightouch
January 18th, 2021 | 59 mins 33 secs
An episode about the Hightouch platform and how it allows you to maintain a single source of truth for all of your customer data in your data warehouse and keep all of your downstream systems accurate and up to date.
-
Enabling Version Controlled Data Collaboration With TerminusDB
January 11th, 2021 | 57 mins 48 secs
An interview about the TerminusDB platform and how it supports data collaboration through a version controlled graph storage engine.
-
Bringing Feature Stores and MLOps to the Enterprise at Tecton
January 4th, 2021 | 47 mins 40 secs
An interview with Kevin Stumpf, CTO of Tecton, about his work building an enterprise grade feature store and how it functions as the core element of an MLOps strategy.