About
Episodes
Host
Contact
Search
Advertise
Be Our Guest
The Machine Learning Podcast
Subscribe

Data Engineering Podcast

Episode Archive

RSS
Apple Podcasts
Amazon Music
Google Podcasts
Spotify
Stitcher
TuneIn

Episode Archive

423 episodes of Data Engineering Podcast since the first episode, which aired on January 7th, 2017.

Building Auditable Spark Pipelines At Capital One

December 12th, 2021 | 42 mins 9 secs

An interview with Capital One engineer Gokul Prabagaren about his work on building Spark workflows with a data enrichment approach to provide auditable data transformations.
Deliver Personal Experiences In Your Applications With The Unomi Open Source Customer Data Platform

December 11th, 2021 | 57 mins 33 secs

An interview about the open source Unomi framework for building a customer data platform and how it can provide a personalized experience to your audience.
Data Driven Hiring For Data Professionals With Alooba

December 4th, 2021 | 50 mins 2 secs

An interview with Alooba founder Tim Freestone about the challenges of interviewing data professionals and how he is working to provide a more detailed view of candidates abilities through high quality skills assessments.
Experimentation and A/B Testing For Modern Data Teams With Eppo

December 4th, 2021 | 58 mins

An interview with Eppo founder Chetan Sharma about the challenges of designing, running, and analyzing product experiments and the work that he is doing to make it more accessible to organizations of every size.
Creating A Unified Experience For The Modern Data Stack At Mozart Data

November 27th, 2021 | 58 mins 31 secs

An interview with Peter Fishman and Dan Silberman about how they are working to reduce the effort involved in setting up and integrating the various components of the modern data stack at Mozart Data.
Doing DataOps For External Data Sources As A Service at Demyst

November 27th, 2021 | 59 mins 16 secs

An interview with Demyst founder Mark Hookey about the use cases for external data sources and how they have built a DataOps platform to provide third party data sets as a service.
Laying The Foundation Of Your Data Platform For The Era Of Big Complexity With Dagster

November 20th, 2021 | 1 hr 5 mins

An interview with Nick Schrock about how the Dagster framework is focusing on taming the complexity of data workflows, the introduction of Dagster Cloud for reducing the operational burden, and his philosophy on the boundaries for commercial and open source features going forward.
Exploring Processing Patterns For Streaming Data Integration In Your Data Lake

November 20th, 2021 | 52 mins 53 secs

An interview with Ori Rafael about the benefits of streaming data integration for data lake analytics and how to design your pipelines when migrating from a batch oriented mindset.
Data Quality Starts At The Source

November 14th, 2021 | 58 mins 54 secs

An interview with Michael Harper about the benefits of being proactive about data quality efforts and building expectations and metrics into every stage of your pipelines, from source to destination.
Eliminate Friction In Your Data Platform Through Unified Metadata Using OpenMetadata

November 10th, 2021 | 1 hr 6 mins

An interview about the OpenMetadata project and how it can provide a universal metadata layer for your whole data environment through common schema definitions and a simple architecture
Business Intelligence Beyond The Dashboard With ClicData

November 6th, 2021 | 1 hr 2 mins

An interview with Telmo Silva about all of the layers involved in a full featured business intelligence system and how he created ClicData to make them available to organizations of every size.
Exploring The Evolution And Adoption of Customer Data Platforms and Reverse ETL

November 4th, 2021 | 1 hr 2 mins

An interview with Tejas Manohar of Hightouch and Rachel Bradley-Haas of Big-Time Data about how the growth of customer data platforms led to the introduction of reverse ETL systems and how you can use them together to improve your customer experience.
Removing The Barrier To Exploratory Analytics with Activity Schema and Narrator

October 29th, 2021 | 1 hr 8 mins

An interview with Ahmed Elsamadisi about how the Narrator platform uses the activity schema to make self service exploratory analytics a seamless experience.
Streaming Data Pipelines Made SQL With Decodable

October 28th, 2021 | 1 hr 9 mins

An interview with Eric Sammer about the difficulty of working with streaming engines at a low level of abstraction and how he and his team at Decodable are working to make development of streaming data pipelines as straightforward as writing SQL
Data Exploration For Business Users Powered By Analytics Engineering With Lightdash

October 22nd, 2021 | 1 hr 6 mins

An interview with Oliver Laslett about the open source Lightdash framework for business intelligence and how it builds on the work that your analytics engineers are doing with dbt.
Completing The Feedback Loop Of Data Through Operational Analytics With Census

October 20th, 2021 | 1 hr 9 mins

An interview with Boris Jabes of Census about the growing trend of operational analytics, how it allows data teams to complete the feedback loop for data value, and how the Census platform is architected to make it easy to implement.

← Previous
1
2
…
11
12
13
…
26
27
Next →

Data Engineering Podcast is © 2024 by Boundless Notions, LLC.

About
Episodes
Host
Contact
Search
Advertise
Be Our Guest
The Machine Learning Podcast
Subscribe