About
Episodes
Host
Contact
Search
Advertise
Be Our Guest
The Machine Learning Podcast
Subscribe

Data Engineering Podcast

Episode Archive

RSS
Apple Podcasts
Amazon Music
Google Podcasts
Spotify
Stitcher
TuneIn

Episode Archive

422 episodes of Data Engineering Podcast since the first episode, which aired on January 7th, 2017.

Moving Machine Learning Into The Data Pipeline at Cherre

April 19th, 2021 | 48 mins 4 secs

An interview about how the team at Cherre built an internal machine learning project to use as a service in their data pipelines to make dealing with messy address data less painful.
Exploring The Expanding Landscape Of Data Professions with Josh Benamram of Databand

April 12th, 2021 | 1 hr 8 mins

An interview with Josh Benamram about the emerging roles across the data ecosystem and how they interact with data systems.
Put Your Whole Data Team On The Same Page With Atlan

April 5th, 2021 | 57 mins 36 secs

In this episode Prukalpa Sankar discusses how Atlan uses metadata from all of your workflows to bring everyone on the same page, letting you delivery on your data projects in record time.
Data Quality Management For The Whole Team With Soda Data

March 29th, 2021 | 58 mins

An interview about the Soda Data platform and the open source components that they are building to level up the quality of your data pipelines.
Real World Change Data Capture At Datacoral

March 22nd, 2021 | 49 mins 58 secs

An interview with Raghu Murthy about the reality of running and maintaining change data capture pipelines for customers at Datacoral.
Managing The DoorDash Data Platform

March 15th, 2021 | 46 mins 4 secs

An interview with Sudhir Tonse about his work at DoorDash to help build a data platform that scales to meet the self service needs of analysts and data scientists.
Leave Your Data Where It Is And Automate Feature Extraction With Molecula

March 8th, 2021 | 51 mins 39 secs

An interview with H.O. Maycotte about how the Molecula platform and the underlying Pilosa engine allows you to automatically do feature extraction from your data without having to centralize it.
Bridging The Gap Between Machine Learning And Operations At Iguazio

March 1st, 2021 | 1 hr 6 mins

An interview about how the Iguazio platform reduces the friction in bringing your machine learning workloads to production in a fast and maintainable way.
Self Service Open Source Data Integration With AirByte

February 22nd, 2021 | 52 mins 15 secs

An interview about the open source data integration platform Airbyte and how you can use it to provide self service access to your data consumers.
Building The Foundations For Data Driven Businesses at 5xData

February 15th, 2021 | 52 mins 15 secs

An interview with Tarush Aggarwal about his work at 5xData to help more companies build the foundations they need to become truly data driven.
How Shopify Is Building Their Production Data Warehouse Using DBT

February 8th, 2021 | 46 mins 30 secs

An interview with Shopify's engineers about how they are using DBT to build a data warehouse platform that scales to meet the needs of the business.
System Observability For The Cloud Native Era With Chronosphere

February 1st, 2021 | 1 hr 4 mins

An interview about the Chronosphere platform and the M3DB storage engine for managing system metrics to power observability in the cloud native era.
Making It Easier To Stick B2B Data Integration Pipelines Together With Hotglue

January 25th, 2021 | 34 mins 5 secs

An interview with the founders of the Hotglue platform about how they are helping B2B application developers build data integration pipelines for working with customer information.
Using Your Data Warehouse As The Source Of Truth For Customer Data With Hightouch

January 18th, 2021 | 59 mins 33 secs

An episode about the Hightouch platform and how it allows you to maintain a single source of truth for all of your customer data in your data warehouse and keep all of your downstream systems accurate and up to date.
Enabling Version Controlled Data Collaboration With TerminusDB

January 11th, 2021 | 57 mins 48 secs

An interview about the TerminusDB platform and how it supports data collaboration through a version controlled graph storage engine.
Bringing Feature Stores and MLOps to the Enterprise at Tecton

January 4th, 2021 | 47 mins 40 secs

An interview with Kevin Stumpf, CTO of Tecton, about his work building an enterprise grade feature store and how it functions as the core element of an MLOps strategy.

← Previous
1
2
…
15
16
17
…
26
27
Next →

Data Engineering Podcast is © 2024 by Boundless Notions, LLC.

About
Episodes
Host
Contact
Search
Advertise
Be Our Guest
The Machine Learning Podcast
Subscribe